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Abstract 



Within the framework of equilibrium statistical mechanics the free energy of a phase gives a measure of its associated proba- 
bihstic weight. In order to determine phase boundaries one must then determine the conditions under which the free energy 
difference (FED) between two phases is zero. The underlying complexity usually rules out any analytical approaches to the 
problem, and one must therefore adopt a computational approach. The focus of this thesis is on (Monte Carlo) methodologies 
for FEDs. In order to determine FEDs via Monte Carlo, the simulation must (in principle) be able to visit the regions of 
configuration space associated with both phases in a single simulation. Generally however one finds that these regions are 
significantly dissimilar, and are separated by an intermediate region of configuration space of intrinsically low probability, so 
that a simulation initiated in either of the phases will tend to remain in that phase. This effect is generally referred to as the 
overlap problem and is the most significant obstacle that one faces in the task of estimating FEDs. 

In chapter|2]we start by formulating the FED problem using the Phase Mapping (PM) technique of Q]- This technique 
allows one to circumvent the intermediate regions of configuration space altogether by mapping configurations of one phase 
directly onto those of the other. Despite the improvement that one gets when formulating the problem via the PM, the overlap 
problem persists, albeit to a lesser degree. In chapter|2]we define precisely what we mean by overlap and then discuss a range 
of methods that are available to us for calculating the FEDs within the PM formalism. In the subsequent chapters we then 
focus on the three generic strategies that arise in addressing the overlap problem. 

The first strategy that we focus on (chapter|3} is that of the representation. This corresponds to the choice of coordinate 
system with which one parameterises the degrees of freedom of the two phases. The PM works by matching these coordinates 
in the two systems, and therefore its efficiency in generating overlap is dependent on the choice of representation. We examine 
a particular representation in which the PM matches the fourier coordinates of the two phases in such a way so as to achieve 
perfect overlap in the harmonic limit (for structurally ordered phases). Previous formulations Q] have been limited to the real- 
space PM (RSM). By comparing the RSM to the fourier version (FSM), we show that for a range of temperatures the overlap 
associated with the FSM is considerably better than that of the RSM, thus allowing one to determine the FED efficiently under 
conditions in which the RSM would otherwise fail. 

The second strategy that we study (in chapter |3 is that of the estimator which one uses to determine the free energy 
difference. The different estimators pool the data from the various regions of configurations space in different ways, and 
therefore have different (systematic and statistical) errors associated with them. As a consequence the severity of the overlap 
problem is dependent on the estimator that one employs. We examine conditions under which the different estimators are able 
to arrive at estimates which are free of systematic errors. 
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Generally however, the scope for refinement of the representation and the estimator is limited. In chapter|5]we deal with 
the third and final strategy which appears in the FED problem; the refinement of the sampling strategy. This involves the 
construction of a sampling distribution which explores regions of configurations space outside those typical of the two phases, 
and thereby engineers overlap. In particular we examine three strategies. The first is the Multicanonical strategy, and involves 
the introduction of corrections to the configurational energy appearing in the Metropolis acceptance probabilities, so as to 
force the simulation outside the regions associated with the corresponding phase. The second that we examine, and which has 
been developed here for the first time, is the Multihamiltonian strategy. This involves the independent simulation of several 
systems, which overlap in the regions of configuration space that they explore, in a way which allows one to determine the 
FED. The advantage of this method is that it is highly parallelizable in a way that is not possible for the other two methods. 
The third and final method that we study is the Fast Growth method, and involves performing non-equilibrium work on the 
system so as to force it from the regions of configuration space associated with one phase to those of the other We demonstrate 
that all three methods are effective in overcoming the overlap problem. 

The discussion until this point will be limited to the task of determining phase boundaries within the purely classical 
framework. At low temperatures, however, quantum effects begin to become increasingly important; this is especially true for 
particles with light masses. Generally there are two types of quantum effects that arise. The first are quantum discretisation 
effects, which arise when the typical energy is of the order to the phonon excitation energies. The second effect is that of 
quantum exchange, which arises when the de Broglie wavelength becomes roughly of the order of the interatomic spacing. 
In the final part of this thesis (chapter|5Jl we address the additional phenomena which arise from the quantum discretisation 
effects by generalising the PM formalism so as to be applicable within the path integral formulation of statistical mechanics. 
The path integral approach allows one to obtain thermodynamic information of a quantum system by modelling a classical 
system in which the interacting particles are replaced by interacting polymers. The simulation of such a system expends 
considerably more computational effort than its classical counterpart, and as a result makes the calculation of the quantum 
FED considerably more difficult. We get around this problem by using the parallelizability of the Multihamiltonian method 
to divide the computation of the quantum FED amongst several processors in a parallel cluster. This allows us to examine the 
importance of zero point motion on the quantum Lennard- Jones phase diagram. 
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List of Symbols 

• = : equality up to a normalisation constant which is not known a-priori. 

• < Q >TT '. this denotes the expectation of a macrovariable Q with respect to the sampling distribution tt (see Eq. I1.30t . 

• 7 : this variable denotes the phase label. The two possible values for 7 are 7 = A and — B. 

• (3 = where k is the Boltzmann constant 

• AAj = Aj+i — \i. This is the increment in the field parameter For all the simulations employed in this thesis, the 
increments were the same, so that AAj = AA (see section l2'.4.4l and section l2'.4.8> . 

• AT : this corresponds to the time (in the FG method) for which the system is equilibrated, before work is subsequently 
performed on it. 

• A-y (r) : variable which keeps track of the phase. It assumes the value unity if r corresponds to a configuration generated 
in phase 7 and zero if not. In order for this function to be able to work, one must (in the most general case) keep track 
of the phase label by appeal to the reference configuration (Eq. 12.11 Eq. I2.6l l about which the particles are displaced. 

• A-y[il/] : variable which assumes the value unity if Afe[M]^ and zero otherwise. In the case where [M]a and [M]b 
do not overlap (i.e. when M is an order parameter for the two systems), then this function may be used to identify the 
phases. 

• A : the field parameter used to morph the configurational energy of one phase onto that of the other phase. See Eq. 12.481 
for a simple example of this. 

• \q : See Eg. 16.221 

• 7r'^(r) : the Boltzmann sampling distribution. Eg. 11.261 

• 7r!^(r) : the Boltzmann sampling distribution when constrained to phase 7 (see Eq. I1.46> . Since a phase essentially 
corresponds to a local basin of attraction in the configurational energy E{r), this distribution may be realised by imple- 
menting a simulation initiated in phase 7. The local structure of the configuration space will ensure that the simulation 
will remain in that phase. 

• 7r^j!(V) : the canonical MH sampling distribution given by Eq. 15.201 in terms of the collective configuration of the 
composite (multi-replica) system. 

• T^PS — T^psi'^' 7) • canonical PS sampling distribution (Eq. I2.74> in terms of the effective configuration v of the 
system, where the phase label is a stochastic variable. 

• 7r™(v) : the superscript m denotes a MUCA samphng distribution. 

• 7r^({u}) : the quantum sampling distribution of phase 7 (see Eg. l6.71> . 
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• T : the effective temperature in the quantum system (Eq. I6.I6I 1. 

• A(x) : the metropoHs acceptance function, Eq. 11.291 

• D : global translation vector appearing in the PM formulation. Figure [T31 Eg. 12.31 

• D : The De Boer parameter (Eq. 16.371 1 appears in quantum Lennard- Jones systems and fixes the temperatures at which 
the different quantum effects become important (see also appendix|HJ. 

• E{r) : the configurational energy of the system. 

• (v) : the configurational energy of phase 7 in the v representation, Eq. 12.81 

• AE'^j^ : the difference in energies between the reference configuration of phase B and that of phase A (see Eq. 12.1 1> . 

• el^ : the i'th component of the j 'th eigenvector of the dynamical matrix K-y of phase 7. 

• £7(v) : difference between the actual energy of the system and the energy of the reference configuration (Eq. 12. 9> . 
In the case of the reference configuration being the ground state (as it is in the case of crystalline solids, where the 
reference configurations correspond to the lattice sites), (v) corresponds to the excitation energy of the system. We 
will frequently refer to this simply as the 'configurational energy'. 

• £!j(v) : the harmonic contribution to the excitation energy of phase 7 (see Eq. I3.7l and Eq. I3.14> . 

• £°(v) : the anharmonic contribution to the excitation energy of phase 7. 

9 E\ : a configurational energy which is a function of the field parameter A. One may use this function to construct a 
chain of configurational energies (see Eq. I2.44> which links the configurational energy associated with phase A (Ea) to 
that associated with phase B (£b)- The most straightforward parameterisation, one which we will employ, is the linear 
one ofEa. l248l 

• : the configurational energy of the quantum system. 

• : the absolute free energy of phase 7, Eq. 11.141 

• AFba = Fb-Fa 

• AFg^ : the anharmonic contribution to the FED. 

• H{MBA,i\'^) ■ the histogram recording the number of times a data output of the simulation falls in bin MBA,i under an 
experiment performed with the sampling distribution tt. 

• Jf-y! In the MH method this corresponds to the configurational energy 'associated' with the extended system (see Eq. 

KT5\ . 

• In the quantum case this is used to refer to the configurational energy of the (classical) polymeric system repre- 
senting phase 7 (see Eq. I6.26> . 
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• H : denotes the hamiltonian operator of the (quantum) system. 

• K.J : the dynamical matrix of phase 7 (see Eq. I3.8I I. 

• kl^ : the eigenvalue corresponding to the eigenvector el^ . 

• [M]^ : the set of macrostates consistent with phase 7. Note, however, that when one performs a simulation constrained 
to phase 7 (via 7r^(r)), the simulation will only visit a subset of [M]j. 

• Mj^ : the macro variable in Eq. 12.151 

• A'Ibaa '■ bin i in Mba space. (i=l,2,..,b where b = number of bins). 

• MBA{i) ■ the i'th output of Mba during the course of the simulation. (i=l,2,...,t), t being the final output of the 
simulation. 

• T]{MBAa) ■ the MUCA weights. 

• 77^*^ : the weight associated with sub-ensemble i for the simulated tempering method (see Eq. I2.58t . Note that unlike 
the multicanonical weights ri{MBA,i) these are not functions defined on configuration space. 

• N : number of particles 

• O: the overlap parameter, Eq. 12.181 This variable assumes the value unity if there is perfect overlap between the two 
phase constrained distributions and if there is no overlap. 

• P : the number of rephcas in the polymeric system modelling the quantum phase (see Eq. \6.25l . 

• y : the permutation operator 

• P(r) : absolute canonical probability of observing a configuration r (Eq. ll.U . 

• P{M) : absolute canonical probability of observing a macrostate M (Eq. I1.3> . 

• P(r I7) : absolute canonical probability of observing a configuration r conditional on being in phase 7 (Eq. II. 5> . 

• P{M\'-f) : absolute canonical probability of observing a macrostate M conditional on being in phase 7 (Eq. \l.6l . 

• P{M, 7) : the joint probability of observing a macrostate M and being in phase 7 rEa. ll.l2V 

• P{MBA\Ti) : the distribution of Mba at timeslice i (when the configurational energy E\ has been changed from £.x-_^ 
to £ Ai , and after the system has been equilibrated for a time AT). 

• P{WBA\'^'y) '■ the estimator (see Eq. 14. 6t of P{Wba\t^^) for the 7^7 FG process. In terms of notation, this is 
equivalent to P{Wba\Cj^7)- 

• J'iy^~({v}) : this denotes the probability of obtaining a path {v} in the 7 ^ 7 FG process, as described in section 
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• r : 3N dimensional column vector denoting the positions of all the particles. 

• Rba ■ the ratio of the absolute partition functions, Eq. 11.81 

• ^"BA ■ the ratio of the partition functions as given in Eq. 12.131 

• ^ BA,p ■ the ratio of the partition functions of the P replica polymeric system representing the quantum phase (see Eq. 
K64\ . 

• R.^ ; a reference configuration in phase 7. For crystalline structures, an appropriate reference configuration is the 
underlying lattice structure, corresponding to the (classical) ground state configuration. 

• Sba ■ the linear transformation used to map the displacements u of phase A onto those of phase B, Eq. 12.141 This 
mapping ensures that the two phases share the same v coordinates. 

• St '■ the total action of the classical polymeric system modelling the quantum phase (Eg. 16.281 . 

• Sk ■ the terms in the total action St which contain information relating to the kinetic properties of the quantum system 
(Eg. 1091. 

• Sv '■ the terms in the total action St which contain information relating to the configurational properties of the quantum 
system (Eq. I6.30> . 

• T : temperature of the heat bath 

• T* : effective temperature appearing in the Lennard- Jones system (Eq. 13. 5> . 

• : the linear transformation which relates the displacements u to the effective configuration v of phase 7, Eq. 12.51 

• T : denotes the kinetic energy operator (Eq. I6.12> of the (quantum) system. 

• u : the displacement of the particles about the reference configuration R-^. 

• V : the effective configuration. These are generahsed coordinates which may be used to parameterise the configuration 
space of the system. When the distinction between the configuration space as described by the r coordinates and that 
described by the v coordinates is necessary, we will refer to the space spanned by the variables r as the absolute 
configuration space, and those spanned by the variables v as the effective configuration space. 

• V : volume of the system. 

• WniWBA)' This measures the contribution of the macrostate Wba to the numerator of the corresponding estimator. 
For the general DP estimator this given by Eq. 14.101 whereas for the EP estimator it is given by Eq. 14.141 and for the 
PS estimator it is given by Eq. 14.481 

• WdiWBA)'- This measures the contribution of the macrostate Wba to the denominator of the corresponding estimator. 
For the general DP estimator this given by Eq. 14.1 II whereas for the EP estimator it is given by Eq. 14.151 and for the 
PS estimator it is given by Eq. 14.491 y 
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• SWba,! ■ the (temperature scaled) work incurred in incrementing the configurational energy from £x. to Ea^+i whilst 
keeping the configuration constant (see Eq. I2.93> . 

• Wba '■ the net (temperature scaled) work (which we will simply refer to as work) appearing in the FG method, obtained 
on changing the configurational energy £>, from £^ to £3 through a series of steps in which at each stage one increments 
A and then equilibrates the system with the new configurational energy for a time AT (see section lT^.SI for details). 

• Wm- the (reversible) work obtained in the limit of thermodynamic integration (see section l2'.4.4l Eq. 12.521 Eq. I5.29> . 
This is also the point at which the two phase constrained distributions P{Wba\t^a) -PIM^baItt^) intersect (see 
figure l4!2t so that Wm = — In JI-b a- 

• utn ■ the n-th cumulant (see Eq. I3.24^ of the probability distribution P{Mjj\Tr^). 

• C7^7 • this is used to denote the 7 — > 7 FG process, as described in section U!TSl 

• Z : the absolute partition function, Eq. 11.21 

• : the absolute partition function of phase 7, Eq. 11.71 

• Z\. : the partition function associated with the configurational energy E\. (see Eq.|^^}. 
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List of Acronyms 

• AR : acceptance ratio method denotes the estimator of the FED in which one performs two independent simulations, 
one in each phase, and estimates the expectations of the acceptance probabilities (see Eq. 12.311 or more generally Eq. 

• DP : dual phase. This refers to the most general canonical perturbation formula (see Eq. I2.34l i. 

• EP : this refers to the exponential perturbation estimator (Eq. I2.28> of JI'ba in which the FED is estimated from data 
extracted from a simulation constrained to a single phase. 

• FED : free energy difference, Eq. 11.151 Eq. 11.161 Eq. 12.101 Since the problem of estimating JI-ba is equivalent to that 
of estimating the FED of the two phases, we will frequently interchange the use of the terms JI-ba and FED. 

• FF : this refers to the fermi function estimator corresponding to Eq. 14.281 The optimal C is obtained by recursively 
solving Eq. lOD and Eq. l432l 

• EG : the fast growth method process (see section l2'.4.8ll5.4l l. 

• FSM : the fourier space mapping is a particular realisation of the general phase mapping (Eq. I2.14> in which the fourier 
coordinates of one phase are mapped onto those of the other phase (see Eq. I3.16t . 

• HOA : higher order approximation. 

• LJ: Lennard Jones (refers to the pairwise interactomic potential given in Eq. 13. 2> . 

• MH : the multi-hamiltonian strategy is an extended sampling strategy which involves the construction of several inde- 
pendent but overlapping distributions. These overlapping distribution then allow the construction of a path linking the 
typical macrostates of the two phases (see section lSJt . 

• MS : the multistage strategy is similar in principle to the MH methods (see section l2'.4.4> . 

• MH-PS : the PS method, as implemented within the framework of the MH extended sampling strategy (see section l53t . 

• MUCA : multicanonical (see sections l2A3l 12.4.71 15^ . 

• NVT : this refers to a system whose volume V and temperature T are maintained at a constant value and in which the 
number of particles within the system remains unchanged during the course of the simulation. Such a system is referred 
to as a canonical system and has a distribution given by the Boltzmann distribution (Eq. II. U . 

• PA : primitive approximation 

• PM : 'phase mapping' refers to the scheme whereby the configurations of one phase are mapped onto those of the other 
phase. The employment of a phase mapping allows one to map the problem of estimating the FED between the two 
phases onto that of estimating the FED between two systems with different configurational energies (see Eq. I2.13> . 
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• PS : the 'phase switch' should not be confused with the PM. This corresponding to a simulation in which attempts to 
switch phases are actually made, and whose corresponding estimator for JI-ba essentially amounts to measuring the 
(unbiased) ratio of the times spent in the two phases (see Eq. I2.78> . See chapter|4]for generalisations of this method. 

• Q-FSM : quantum fourier space mapping. 

• Q-RSM : quantum real space mapping. 

• REP : restricted exponential perturbation formula (see Eq. 14. 3t . 

• RDP : restricted dual phase perturbation formula (Eq. 14. 3t . 

• RSM : the real space mapping is a particular realisation of the PM (see Eq. \2Ai . 

• ST ; simulated tempering (see section l2'.4.6> . 

• WHAM : the weighted histogram analysis method (see section l2'.4.5> . 
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Glossary 

• order parameter : macrovariable which assumes a different ranges of values in the different phases. These ranges are, 
by definition, non-overlapping 

• A : this is the phase label denoting the fee structure. 

• B : this is the phase label denoting the hep structure. 

• configuration space : this term is used both to refer to the space spanned by r and that spanned by v. When the 
distinction between the configuration space as described by the r coordinates and that described by the v coordinates is 
necessary, we will refer to the space spanned by the variables r as the absolute configuration space, and that spanned by 
the V coordinates as effective configuration space. 

• canonical : in the context of sampling, this refers to sampling from a Boltzmann distribution (see Eq. I1.26> . 

• conjugate phase : this corresponds to the phase which the simulation is currently not in and the phase onto which the 
configurations are being mapped. In the case of the phase switching method this changes during the course of the 
simulation. In the case of a phase constrained simulation this remains the same for the entire duration of the method. 

• dual phase : this refers to estimators of the form of Eq. 12.1051 which explicitly involve the accumulation of data from 
two simulations, one constrained to each phase. 

• extended sampling strategy : this refers to the procedure whereby the sampling distribution is made to encompasses a 
wider (or extended) region of configuration space than is typically associated with the canonical distribution, which the 
expectations are performed with respect to. The desired expectations are recovered from Eq. 11.321 

• macrostate : this corresponds to the collection of configurations which yield a particular value for a given macrovariable. 
(See also Eq. ll.3l for the probability of observing a given macrostate). 

• parent phase : this refers to the phase which the simulation is currently in and from which the configurations are being 
mapped onto the other phase. In the case of the phase switching method this changes during the course of the simulation. 
In the case of a phase constrained simulation this remains the same for the entire duration of the method. 

• partial overlap : see note 1 2 1 

• path : this refers to a sequence of (closely spaced) macrostates which are actually sampled during the course of a 
simulation and which connect the regions of (effective) configuration space associated with one phase to those associated 
with the other phase. 

• representation : this refers to the particular way in which one expresses the degrees of freedom (v) of the phase (see Eq. 
12. 8t . Since the PM matches the v coordinates of the two phases, the representation directly affects the overlap obtained 
under the operation of the PM. 
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• system : We will frequently interchange this word with the word "phase" 

• thermodynamic Umit : Umit of the system size tending to infinity. That is N 



Chapter 1 



Introduction 



1 .1 Phases and their stability 

The material world around us comprises of matter and its interactions. Depending on the strengths and ranges of these 
interactions matter, on the macroscopic scale, displays a variety of collective properties. These collective properties result in 
the formation of different "phases" of matter such as gas, liquid, and solid. For these phases there are two levels of description, 
which are the microscopic and macroscopic approaches. The microscopic picture describes matter in terms of its constituent 
particles and their interactions, whereas the macroscopic description coarse grains the configurational and kinetic information 
of the constituent particles into a small set of so called macrovariables. 

In the case of equihbrium ll3l-||6| these macrovariables fluctuate in time about a mean which remains constant in time, and 
the corresponding theory that describes the interrelation of the means of these variables is thermodynamics. The fundamental 
parameters which enter into the theory are certain macrovariables (such as the configurational energy E), certain parameters 
called intensive variables (such as the temperature and the chemical potential) which do not explicitly depend on the system 
and instead describe the coupling of the system with the environment, and the concept of entropy, which is a measure of the 
amount of disorder present in the system. 

Thermodynamics is useful in that it explains the interrelation amongst some of the most important macrovariables. How- 
ever, the theory does not give one the power of being able to predict how these macrovariables (or more precisely the means 
of these macrovariables) vary as one changes the intensive parameters. In order to do this one must resort to the microscopic 
description of the phenomena. A full blown microscopic approach would be (in the classical case) to solve Newton's equations 
for the particles and then average the relevant macrovariables over sufficiently long times, or (in the quantum case), to solve 
the multi-particle Schrodinger equation and evaluate the time averages of the expectation of the relevant macrovariables. Such 
an approach is, however, analytically intractable and one must instead resort to a more approximate microscopic approach. 

The relevant microscopic theory is that of statistical mechanics, a theory in which all temporal effects have been averaged 
out. The core ingredients of the theory are the set of spatial configurations which the system may assume, the set of intensive 
variables which describe the system-environment coupling, and the configurational energy of the system. Using these, one 
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may then construct probabilities for finding the system in a given configuration at any given instant of time. Since the theory 
is independent of kinetics, a considerably reduced amount of effort is required in describing phenomena. For a more in-depth 
development of the points mentioned above, the reader is referred to some standard texts on statistical mechanics Pl-fTl. 

In order to describe phenomena directly via statistical mechanics we must first explain more precisely what exactly we 
mean by a phase. Within the framework of statistical mechanics a phase corresponds to the group of microscopic configura- 
tions in which the constituent members of any given group exhibit some common property unique to that phase. For example, 
in the case of a crystalline solid phase, the associated group would correspond to all configurations in which the particles are 
displaced by "small" amounts about some lattice structure. This lattice structure, which is the common characteristic of all 
the configurations, is what identifies the group and different lattice structures yield different groups or different phases. By 
grouping the configurations in this way one may also calculate the probability associated with a phase simply by summing 
the probabihties of the constituent configurations. The result is proportional to a quantity called the partition function of the 
phase, which plays a central role in statistical mechanics. 

The properties of a phase can, within the framework of thermodynamics, be predicted through a central quantity called the 
free energy. On the other hand all such predictions will, within the scheme of statistical mechanics, stem from the probability 
distribution of the configurations associated with that given phase. Not surprisingly it turns out that the free energy of a phase 
is intimately related to the partition function of that phase, with the intensive variables being the common parameters in the 
two theories. By finding the partition function of a phase, one is able to predict its behaviour quantitatively in the macroscopic 
limit. 

Of all the predictions that statistical mechanics can make, we shall focus on one. Namely, given a set of candidate phases, 
which phase is the one that is actually going to be found in nature for a given set of constraints (of the environment on the 
system)? Within the framework of statistical mechanics this translates to the task of finding out which is the most probable 
phase, or correspondingly finding out which phase has the largest partition function. In the context of determining phase 
boundaries, where one is trying to determine the more probable out of two candidate phases, one may reduce to number 
of calculations by merely focusing on the ratio of the partition functions (which entails a single calculation) as opposed to 
focusing ones efforts on the calculation of the individual partition functions (an approach which will require two separate 
calculations). 

The analytic evaluation of the ratio of partition functions (or equivalently the free energy difference) is, however, no simple 
task. Despite the fact that temporal effects have been averaged out in statistical mechanics so as to considerably simplify the 
theory, it turns out that, for most complex systems of interest, calculation of the desired properties via analytic techniques 
remains intractable. One instead has to resort to computation. However even within the framework of this approach, the task 
of determining the ratio of the partition functions still remains a difficult one. In the rest of this thesis we will primarily be 
concerned with the investigation and development of computational methods of determining the FED (or equivalently the ratio 
of the partition functions) of two different phases. 

In the next section we will introduce the necessary machinery which will enable us to define exactly what we mean by the 
partition function of a phase. We will express the partition function of a phase as a multidimensional integral and illustrate 
how the ratio of the partition functions can be thought of as the ratio of two multidimensional integrals in which the regions 
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which contribute the most come from two non-overlapping regions of the space over which the integrals are defined. We will 
then briefly discuss the Metropolis algorithm, which is a widely used computational technique to model equilibrium phase 
behaviour This will be followed by a discussion of how the method may, in principle, be used to estimate the FED, and how 
in practice it fails. 



1.2 Statistical mechanics : The formulation of the problem 
1 .2.1 Key concepts and definitions 

In this section we develop the necessary statistical mechanical theory (8). Suppose that we have a system with a fixed number 
of particles N, at a fixed volume V, and at a fixed temperature T. Such a system is referred to as a canonical or N VT system. Let 
r denote the 3N dimensional column vector containing the positions of all the particles. Within the framework of equilibrium 
statistical mechanics it follows that, when the dynamics of the system have been averaged out over sufficiently long periods 
of time, the canonical probability of finding the system assuming a configuration r is given by the Boltzmann distribution: 

P(r) - |e-^^W (1.1) 
where E{r) is the configurational energy of the system and Z is the absolute partition function: 

[ dre-^'^W (1.2) 



and /3 — l/kT, where k is the Boltzmann constant. If we have a variable AI ~ Af (r) which is a function on configuration 
space, which we call a macrovariable, it follows that the canonical probabihty of M assuming a value AI* (called a macrostate) 
is then given by: 

P{M*)= J drS{M{r) - M*)P{r) (1.3) 

A macrostate is essentially a collection of microscopic configurations {r} for which a macrovariable assumes a particular 
value. The canonical probabihty of a macrostate (Eq. \l.3l accounts for the fact that there may be a multiplicity of microscopic 
configurations associated with a given macrostate. 

Equipped with the armoury of the probabilities of microstates (Eq. II. H and of macrostates (Eq. II. 3> . we may now 
proceed to define a phase. We first note that in thermodynamics one generally identifies a phase (which we label 7) through 
a macrovariable AI , also called an order parameter, which spans a set of values [AI]^. For an order parameter the set of 
values (say [AI]^ and [Af];^) associated with the two different phases (7 and 7) do not overlap, allowing it to be used as an 
identifying variable for the phases in question. Carrying this idea over into statistical mechanics, one may construct a criterion 
for deciding whether a configuration r belongs to a phase or not by virtue of the following function: 

, 1 : ifAf(r)e[M]^ 
: otherwise 
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If [r] is 1, then the configuration r belongs to phase 7, otherwise it does not. This function essentially uses the property of 
M being an order parameter (that is assuming a unique set of values in the different phases) in order to determine whether a 
configuration belongs to a phase or not. 

Using Eq. 1 1.41 one may immediately write down the partition function and the (canonical) conditional probabilities of 
finding a configuration r and that of finding a macrostate A/*, in phase 7: 



P(r|7) - ^e-'^^MA.W 



Z. 



(1.5) 



and 



P{M*\j) = J dr6{M(r) - M*)P{r\j) 

= — f dr(5(M(r)-M*)e-'3^WA.^[r] 
J 



(1.6) 



where Zj denotes the partition function, or probabilistic weight, associated with phase 7: 



Z-t^ J dre-'^^('")A^[r] (1.7) 

Having now defined the concept of a phase and its corresponding weight (the partition function, Eq. II. 7> we may now write 
the ratio of the partition functions of the two different phases as the ratio of two multidimensional integrals: 



Rba 



< ArW > 



(1.8) 



< AaM > 

where the angular brackets <> denote an expectation with respect to the distribution P(r) (defined more explicitly in Eq. 
ILSOI below). Alternatively, by using the order parameter M one may write the ratio of the partition functions as the ratio of 
two one dimensional integrals: 



where 



/ dM/\B[M]P{M) 
J dMAA[M]P{M) 

< Ab[M] > 

< Aa[M] > 



(1.9) 



A^[M] 



(1.10) 



1 : if Me[M]^ 
: otherwise 

The strategy of re-writing Rba as has been done in Eq. I1.9l is a highly advantageous one since it reduces the multidimensional 
problem in Eq. ll.Sl to the one dimensional problem of Eq. 11.91 It crucially depends on ones ability to find a suitable order 
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parameter M, which may not be possible for smaller systems. In such situations one may instead have to be content with 
a macrovariable M which spans an overlapping range of values ([Af]-y and [M]^) in the two phases. In this case one must 
distinguish the two phases on a microscopic level. For example, in the case of crystalline phases, one may do this by keeping 
track of the lattice vectors about which the particles of the system are displaced. In this more general case Eq. 11.51 Eq. 11.71 
and Eq. ll.Sl will continue to hold provided is more broadly defined as: 

f 1 : ifreir}^ 

A^[r]^<^ ' (1.11) 

I : otherwise 

where {r}^ denotes the set of configurations which one would typically associate with phase 7. In the case where [M]^ and 
[M] B partially overlap, the expression in Eq. ll.9l for Rba no longer holds, and must instead be expressed in terms of the joint 
probability distribution of M and 7: 



P(M*,7) = P(M*|7).P(7) 

drS{M{r) - M*)e-^^("")A^[r] (1.12) 



where [r] is now given by Eq. 11.111 It then follows that the ratio of the partition functions may now be expressed more 
generally as: 



Rb/ 



J dMAB[M]P{M, B) 

(1.13) 



/ dMAA[M]P{M, A) 
J dMP{M, B) 



J dMP{M, A) 

It is clear that in Eq. I1.13l the macrovariable M is in fact a redundant variable. Its utility, however, lies in the estimation of 
Eq. I1.13l via simulations, where the macrovariable M is used to guide the simulation to certain regions of configuration space 
. We will have more to say about this in section |24| (in particular sections l2.4.3l and l2.4.7k 

1 .2.2 The link to thermodynamics 

In order to establish the connection between statistical mechanics and thermodynamics we first note that, in statistical me- 
chanics, questions as to the relative stability of phases may be entirely addressed through the quantity Rba^ (Eq. 11.81 Eq. 11.91 
and Eq. 11.131 ). If this quantity is greater than unity, phase B is the more stable. Otherwise phase A is the more stable of the 
two. Thermodynamics, on the other hand, extracts the corresponding information through the free energy (F^) of the phase. 
The phase which has the lower free energy is the more stable of the two. The identity which bridges the two theories is the 
following: 



(1.14) 
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It follows from Eq. II. 141 that the ratio of the partition functions is intimately related to the FED of the two phases: 



Rba = e 



(1.15) 



where Fba is the free energy difference: 



Fba = Fb-Fa 



(1.16) 



From thermodynamics we know that the equilibrium phase (that is the one which is found in nature, subject to the necessary 
constraints) is the one with the minimum free energy. This is consistent with the statistical mechanical formulation since from 
Eq. 1 1.1 41 this merely corresponds to the phase with the maximum probabilistic weight Z^. Furthermore since the free energy, 
and hence the free energy difference, is an extensive quantity (that is AFba N), it follows from Eq. I1.15l that in the limit of 
— > oo (called the thermodynamic limit) the difference in partition functions of the two phases will magnify so as to make 
one of the phases overwhelmingly more probable than the other This is in line with the thermodynamic observation of there 
being only one phase that is consistent with the constraints imposed on the system |9l. 

1.2.3 Summary 

Summarising, if one has two candidate phases, and one wants to find out which will appear in nature, one can construct a finite 
system and calculate the ratio of the partition functions Rba via Eq. 11.81 Eq. 11.131 Eq. 11.91 This allows one to determine the 
more stable (or more probable) of the two phases. It then follows that in the thermodynamic limit this phase then becomes 
overwhelmingly more probable that the other and as a result will be the one found in nature |10|. 

For most interesting systems, even for a finite system the underlying complexity rules out any analytic approach. One must 
instead resort to computational techniques. The Monte Carlo method is a computational approach which is particularly suited 
for the simulation of equilibrium systems in which one is not concerned with the dynamics but merely static, time averaged 
quantities. In the next section we will briefly introduce a particular type of Monte Carlo method, called the Metropolis 
algorithm, and then discuss in section [r.3.2l how this method may, in principle, be used to tackle the problem of estimating the 
ratio of the partition functions Rba- 

1.3 Simulation tools 

1.3.1 The Metropolis algorithm 

I . 3.1 .1 Constructing the method 

There are two main simulations techniques which are employed to sample configuration space distributed according to Eq. 

II. 11 II II . 1121 . The first is molecular dynamics, a method which we do not employ in this thesis. For further information 
refer to 1131 . The second, and more natural (in the context of equilibrium statistical mechanics) is the Metropolis algorithm 
1141 . 1151 . Unlike molecular dynamics, in which the dynamics is purely deterministic, the Metropolis algorithm is a purely 
probabiUstic method. We will now describe the method in some detail. 
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The Metropolis algorithm works by generating a sequence of configurations {r(l), r(2), r(t)} in which the probability 
of generating a configuration r{t + 1) is only dependent on the current configuration r{t). This algorithm may be constructed 
in such a way so as to ensure that in the infinite time limit {t oo), the relative probabilities of configurations appearing in 
the chain satisfy any arbitrary sampling distribution 7r(r). To see how this is done consider the rate equation for 7r(r): 



97r(r) 



= J dr'Psir' ^ r).7r(r') - ^(r). J dr'Ps{r ^ r') (1.17) 

where Ps(r' r) denotes the transition (or sampling) probability of the algorithm from a configuration r' to a configuration 
r. If the transition probabiUty of the algorithm is to yield a process with a stationary distribution (that is a distribution 7r(r) 
which does not change in time) one must have: 

^ = (1.18) 

ot 

Clearly one way, but by no means the only way, in which Eq. ll.lSl mav be satisfied is by assuming that: 



Ps{r' ^ r)^(r') - Ps{r ^ r')^(r) (1.19) 

The constraint on Ps (r' r) in Eq. I1.19l is called the condition of detailed balance and is used by the Metropolis algorithm 
in order to produce a chain of configurations in which different configurations appear with relative frequencies which are 
consistent with 7r(r) fl^ l. 

In the Metropolis algorithm the procedure of sampling is divided into two stages. The first stage involves generating a new 
configuration r' given a current configuration r. The second stage is that of accepting or rejecting the proposed moves. Let 
Pq (r' |r) denote the probability of generating r' given r, and let Pa (r r') denote the corresponding acceptance probability. 
It follows that the sampling probability may be written as: 

P5(r^r') = PG(r'|r)Pa(r^r') (1.20) 
Using Ea. ll.l9l and ll.20l it is easy to show that: 



Pajr^r') ^ TT{r')PG{v\r') 
Pair' ^ r) ^(r)FG(r'|r) 

Using this one may easily verify that a suitable Pa{r — > r') is of the form 1141 . 1151 . 1171 : 



(1.21) 



P.(r^O^Min{l /f);:f'-^ (1.22) 
7r(r)FG(i"'|r) 



Eq. I1.22l is called the Metropolis acceptance criterion. An alternative, which also satisfies Eq. 11.211 is given by: 

Pa(r^r') = , , „ , (1.23) 

^ ' T , 7r(r)PG(r'|r) ^ ^ 

^ 7r(r')PG(r|r') 

In the simulation of statistical mechanical systems a particular case of Eq. I1.22l is generally adopted. Consider a simulation 
performed via the MetropoUs algorithm in which the generation of a trial configuration involves perturbing a randomly chosen 
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particle to a random position chosen to lie within a specified volume 1181 about the particle's initial point 1131 . For such an 
algorithm, the probability of generating a new configuration r', given a current configuration r, is symmetrical in the following 
way: 

PG(r'|r) -PG(r|r') (1.24) 

As a result Eq. I1.22l simplifies to: 

P,(r->r') =Min{l,^} (1.25) 
7r(r) 

For the particular case where the sampling distribution is the Boltzmann distribution (Eq. II. 1> : 

TT = 7r'==e-'5^('-) (1.26) 
where = denotes an equality up to a normalisation constant which is not known fl71, Eq. I1.25l mav be written as: 

Pair ^ r') = A{(3AE) (1.27) 

where 

AE = E{r') - E{r) (1.28) 

and 

^(x) = Min{l,e"^'} (1.29) 



is the Metropolis acceptance function. The procedure of employing a PG(r'|r) with the property given in Eq. 1 1.241 and the 
acceptance probability Pa{r r') given in Ea. ll.27l forms the cornerstone of the original Metropolis method, and will be the 
one that is used in the canonical simulations (Eq. II. U performed in this thesis. 

Summarising, if one performs a simulation in which one stochastically generates configurations and accepts via the accep- 
tance probabilities of Eq. 11.221 one generates a chain of configurations in which the frequencies of the appearance of different 
configurations are proportional to their probabilities 7r(r). We will now show how this property may be used to estimate the 
expectation of macrovariables. 

1 .3.1 .2 Estimating the expectation of macrovariables 

Suppose now that one wants to evaluate the expectation of some function Q of a macrovariable M with respect to the sampling 
distribution tt: 

_ /dr.(r)Q(M(r)) 
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By using the Metropolis algorithm (that is the stochastic algorithm in which proposed moves are accepted via eq. 11.221 1. one 
may estimate the expectation < Q via the following scheme: 



1 * 

= -J2QiM{^)) (1.31) 

where M{i) denotes the i-th output of the macrostate M by the simulation and where H{Mi\7r) denotes the histogram count 
for bin Mi under a sampling experiment performed via the sampling distribution tt. It is important to keep in mind that it 
is the lack of knowledge of the normalising constant of vr which necessitates the inclusion of the integral / (ir7r(r) in the 
denominator of Eq. ll.30lllT9l . This will have important consequences for the task of estimating the FEDs (see section l2'.4.3> . 

A finite sample estimate given in Eq. I1.31l will generally have a statistical error associated with it. This arises from the 
fact that one is trying to reconstruct the relevant probabihty distribution from a finite number of samples, or equivalently from 
a finite time simulation II20I . In the case of FED calculations, one has, in addition to this, systematic errors. These arise (in the 
context of FED calculations) from not sampling the regions of configuration space which contribute the most significantly to 
the relevant estimator. Once again this arises from the fact that one is running the simulation for a finite amount of time. The 
differences in the two types of errors lie in the time scales needed to reduce the error to an acceptable level, and therefore in 
some circumstances the distinction can become blurred. One may generally think of statistical errors as those which may be 
decreased to a desired level merely by running the simulation for sufficiently long times, where the lengths of time in question 
are generally those for which one would be prepared to run a simulation. In the case of systematic errors, the times needed to 
reduce them to an acceptable level are generally considerably greater (by at least several orders) than one would be prepared 
to wait. The methods which are successful in estimating the FEDs are those which overcome such systematic errors. We will 
have more to say about the way in which they do this in chapter|2]and chapter|5] 

Eq. I1.31l tells us how we may estimate the expectation of a macrovariable (with respect to a sampling distribution tt) based 
on an experiment performed with the same sampling distribution . More generally one may need to estimate the expectation 
of Q with respect to a distribution (say tt) which is different from the sampling distribution tt used to obtain the data. To do 
this we simply re-write the expectation < Q as an expectation with respect to the sampling distribution tt: 



<Q>i 



J drn{r)Q{M{r)) 



1,f/'r.V, 



Jdrn{v)Q{M{r)m 



^ 7r(r) 

where, as is the case in Eq. 11.301 the need to evaluate the denominator of Eq. I1.32l essentiallv arises from the lack of knowledge 
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of the relative normalisation constants of tt and n. Provided is well defined Eq. I1.32l mav be estimated by: 

<-^>^- t #(r(»)) ^^■^^> 

Z^i=l 7r(r(i)) 

where {r(l), r(2), r{t)} denotes the sequence of configurations generated by the simulation and where M{i) — M{r{i)). 
In the special case where is a function f{M{r)) of M{r), that is: 

7r(r) 

we may write Eq. I1.32l as: 



-/(M(r)) (1.34) 



<Q{M)fiM)>. 

^ < /(M) ^'-^^^ 

and we may re-write the corresponding estimator (Eq. I1.33> as: 

Eti/(M.)i/(M,|7r) 

Eq. I1.33l and Eq. I1.36l plav a central role in the task of estimating FEDs via computational techniques. We will see in sections 
12.41 and more clearly in chapter|5j that at the heart of all the methods designed to tackle the problem of estimating FEDs is 
the construction of a sampling distribution tt which differs from the distribution tt with respect to which the expectations are 
performed. We will refer to this as the extended sampling (ES) strategy \21]. We will have more to say about these extended 
sampling strategies in section l2'.4.3l and chapter|5] Before doing this we will describe three general techniques which may be 
used to estimate Eq. II. 81 via simulation and will then proceed to focus on one of these, namely the phase mapping method. 
In the next chapter we will then proceed to review the various methods that are available for estimating FEDs within the 
framework of this method. 



1.3.2 Sampling strategies for estimating Rba 

Broadly speaking there are (for NVT systems) three generic strategies which one may pursue in order to estimate Rba 1221 . 
They are the reference state technique, the continuous phase technique, and the phase mapping (PM) technique. At the heart 
of all the techniques is the concept of a path, which we define to be a series of overlapping macrostates (obtained during the 
course of a simulation) connecting the regions of configuration space associated with one phase to those associated with the 
other. 

In the reference state technique, a path is constructed which connects each phase to a reference system for which the 
partition function is known exactly. In this way one is able to estimate the absolute partition function of each phase. In the 
continuous path technique, a continuous path is constructed from one phase to the other, thereby allowing one to estimate the 
ratio of the partition functions. In the phase mapping (PM) technique, a path linking the two phases is constructed in which 
one "leaps" directly from one phase to the other, omitting all the regions of configuration space lying in between the two 
phases. We will now review these methods in greater detail. 
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1 .3.2.1 Reference State Technique 

In the reference state technique (also called thermodynamic integration) 1231 . 1241 . 1125 1 , the basic idea is to construct a path 
which connects the desired phase to a reference system for which the partition function is known exactly. This allows one 
to compute the FED between the given phase and the reference system. By performing two such simulations, connecting 
each phase to an appropriate reference system 1261 . one may infer the FEDs between the phases and their respective reference 
systems. Since the partition functions of these reference systems are known a-priori, one may use these results to determine 
the absolute values of the partition functions of each phase fTTj . One may then proceed to determine which is the more stable 
phase of the two. A schematic is shown in figure [l~T] 

Technically, the way in which one links the desired system to the reference phase is as follows. One constructs a config- 
urational energy E\ (r) in which the field parameter A assumes any value between and 1 . Furthermore suppose that at the 
extremities of A = and A = 1 this configurational energy assumes the form of the configurational energies of the reference 
and desired phases respectively (i.e. i?A=o ~ E,. and i?A=i = Ej, where and Ej are the configurational energies of the 
reference and desired phases respectively). Then the fundamental relation upon which the method is based is: 



=0 dX 

dE 



< ^ dX (1.37) 

A=o dX 



where tta is given by: 



7rA(r)=e-'3^^W (1.38) 

Eq. I1.37l must be estimated numerically by first dividing up the interval {A} into a discrete set 
{Ai = 0, A2, A„ = 1}, and then proceeding to estimate Eg. II. 371 via: 

Fa=i - Fx=o =■ ^< dE7/dX>^,^ (A,+i - A,) (1.39) 

i=l 

where < dE\/dX >Tr^. denotes an estimate of the expectation < dE\/dX >tt^ .- 

Generally there will be two sources of error in the estimator of Eq. 11.391 The first will be statistical errors in estimating 
< dE\/dX >Tr^. and the second will be systematic errors arising from the discretisation of the interval {A} 1281 . Whereas 
the statistical errors are made smaller simply by increasing the duration of the simulations, the systematic errors can be made 
smaller by reducing the size of the increments A^+i — A^. By ensuring that the increments A^+i — Ai are sufficiently small, 
one may ensure that the systematic errors are smaller than the corresponding statistical errors, though a-priori it is not clear 
how small these increments have to be in order to ensure that this is indeed the case. 

The presence of systematic errors is one point which makes the reference state technique, as formulated here, a slightly 
unattractive one. Furthermore the reference state technique requires the estimate of two separate quantities, that is the partition 
functions of the individual phases, when one is in fact only interested in the single quantity corresponding to the ratio of these 
quantities. Clearly a single calculation which directly estimates this ratio would be preferable l29l . 
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phase A reference system B 




reference system A 

phase B 



Figure 1.1: Reference State Technique 

In the reference state technique one performs a separate simulation for each phase, in which the simulation connects the phase 
to a reference system for which the partition function is known exactly. Knowledge of the absolute value of the partition 
function of the reference system and the ratio of the partition functions of the phase and the reference system (which is 
estimated from the simulations) allows one to estimate the absolute value of the partition function of each phase. One can 
then determine the more probable of the two phases by noting which of the two has the larger partition function (or smaller 
free energy). 
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1.3.2.2 Continuous Path Technique 



In the continuous path technique 1301 . one performs a simulation which travels from one phase to the other via a continuous 
path, hi order to estimate the ratio of the partition functions (or equivalently the FEDs) of the two phases via Eq. 11.81 Eq. 11.91 
one performs a simulation in which one keeps track of the order parameter M. Using Eq. ll.31l one may then estimate Rba 
from Eq. I1.9l via the identity: 

< Ab[M] > e.b. EliAi3[Af(»)] 
- < A.[M] > - El,A.[M(.)] ^'-''^ 

where A-^ [M] is given by Eq. 11.101 

In the case where the set of values [AI]a and [M]b overlap (that is when AI is no longer strictly an order parameter), 
one must instead use Eq. 11.13! In this case the macro variable M becomes redundant, and one instead estimates Rba via the 
identity: 

^ e^b.ELA^ (1.41) 

EliAA[r(z)] 



where A-y[r(i)] is given by Eg. II. 11! A schematic is shown in figure [l~2l 

There are three problems with the method as it stands. The first is the fact that the estimators in Eq. II .40! and Eq. 11.411 
will generally fail. The reason for this lies in the fact that for straightforward Boltzmann sampling, in which one samples 
according Eq. 11.261 one generates configurations whose frequencies of appearance are in accordance with their canonical 
probabilities. Since the two phases are separated by a region of configuration space characterised by macrostates [M]i of 
extremely low probability (see figure II. 2> the probability of the simulation generating a sequence of configurations which 
traverses this region will be vanishingly small. As a result the simulation will remain stuck in one of the phases, making it 
impossible to estimate Rba from eauation ll .40l or [l~4n since either the numerator or the denominator of these estimators wiU 
be zero. This problem, which is called the overlap problem and is the origin of the systematic errors we were alluding to in 
section [T. 3. II may be overcome with the adoption of appropriate extended sampling strategies 1301 . We will have more to say 
about this in section l2!4lll3Tl . 

The second problem, which is a problem afflicting the case of (structurally) ordered phases, arises from the fact that in the 
process of going from one phase to the other (via [A/]/ in figure [OJ the simulation will in general have to traverse through 
regions of configuration space which are characterised by mixed-phase or disordered configurations. That is the transition 
from one phase to the other will involve the disassembling of a phase, followed by the organisational restructuring resulting 
in the assembling of the other phase. This will result in the formation of a defect-rich final structure in the case where one of 
the phases is a crystalline solid. As a result one will not obtain a correct estimate for Rba f32T. Note that even though the 
first problem, that is the problem of interphase traverse, may be overcome by the use of extended sampling (see for example 
II30I '). this second problem will continue to persist in the case of ordered phases [33 1. 

The third problem, though not as serious as the previous two, arises from the fact that the regions of configurations space 
\M]i which one needs to traverse in going from one phase to the other do not actually contribute to Rba- As a result, the 
simulation will waste large amounts of times in regions of configuration space which do not actually contribute to the final 
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phase A 




Figure 1.2: Continuous Patti Teclnnique 
In this method the simulation traverses between the two phases via a continuous path which involves crossing through regions 
of configuration space which are characterised by mixed phase-configurations. These configurations do not contribute to the 
relevant Rba and are characterised by defect-rich structures. 



estimate of Rba- 

1.3.2.3 Phase Mapping Technique 

A method which overcomes the last two problems of the continuous path technique, namely that of sampling the regions of 
configuration space [M]i characterised by mixed phase-configurations and that of sampling regions of configuration space 
which do not contribute directly to Rba, is the phase mapping technique (PM) (T], I34I - I37I (see also 1381 . |139D. This 
method avoids the intermediate regions (characterised by the set of macrostates [M]i in figure [OJ altogether by using a 
global configuration space shift 113 81 to map configurations of one phase onto those of the other phase. Suppose that one is in 
phase A, with a configuration r. The basic idea is to find a constant displacement D such that r+ D denotes a characteristic 
configuration of phase B (see figure[0}. The result is a mapping of the configurations of one phase onto those of the other 
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via the operation: 

r^r + D (1.42) 
In order to make use of this mapping one re-writes Eq. ll.Sl as: 

_ /dre-^^(.-+D)A^[Af(r + D)] 

- Jdre-mr)AA[Mir)] ^^'^^^ 



By writing this as: 



Eq. I1.43l mav be written as: 



" A^[M(r)] 

= <e-'5[^(-^+°)-^WlAB[Af(r + D)] >,c^ (1.45) 
where tt^ denotes that the expectation is performed with respect to configurations constrained to phase A II40I : 



= P(r|7) (1.46) 
From Eq. I1.31l it follows that Rba may then be estimated via: 

e.b. E-=ie-''[^('"«+°'-^('"«^lAs[M(r(i)+D)] 

^BA - 

= i^e-'^[^('-«+°)-^('-«)lAB[Af(r(i)+D)] (1.47) 

where the data is obtained from a simulation constrained to phase A. Since Eq. 1 1.471 is essentially an estimator involving a 
sampling distribution constrained to a given phase, we will refer to Eq. I1.47l as a phase-constrained estimator For future use, 
we will refer to the phase which the simulation is actually in as the parent phase (phase A, in the case of Eq. I1.47> . and the 
phase onto which the configurations are being mapped as the conjugate phase (which in this case is phase B). Eq. I1.47l is just 
one example of a phase-constrained estimator. Looking forward, we note that these estimators may be most generally written 
of the form of Eq. 1231 

The expression in Eq. 11.451 is essentially a way of estimating FEDs by performing a simulation in a single phase. By 
employing a global configuration space shift D, one is able to bypass the intermediate regions of configuration space region 
characterised by the set of macrostates [M]i in figure [O] The core idea behind the method is to find a global configuration 
displacement D which will allow one to sample the configurations important to Zb as well as those important to Za in a 
single simulation performed in phase A. 
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The problem with the method as it stands is that it is not sufficient that one finds a D such that the macrostates [M]^ 
of the parent phase (A) are mapped onto those, [Afjs, of the conjugate phase (B). In order for Eq. 11.471 to serve as an 
efficient estimator of Rba, one must ensure that the configurations of probable macrostates in [M]a are mapped onto the 
configurations of probable macrostates in [M]b. Generally one will not be able find a suitable D which ensures that this is 
the case. One will instead find that configurations of probable macrostates belonging to [M]a are mapped onto configurations 
of improbable macrostates of This is another form of the overlap problem and results, in a way that will be explained 

in much greater detail in section |231 in the failure of Eq. ll.47l to serve as an efficient estimator for Rba- However since we 
avoid the region of configuration space characterised by the macrostates [M]/, the magnitude of the overlap problem that we 
face in dealing with Ea. ll.47l is considerably less than that of estimating Rba via the continuous path technique ('Ea. ll.4lt . 

1.4 Summary 

In seeking to model the equilibrium behaviour of bulk material in terms of its constituent components a useful microscopic 
theory is that of statistical mechanics. This theory works by associating a probability with each configuration that the system 
may assume. Accordingly one may associate a probability with a phase simply by summing up all the probabilities of the 
configurations consistent with that phase. The net result is proportional to a quantity called the partition function Z^, given by 
Eq.O 

For a given set of external constraints, one finds that for many non-trivial (finite) systems several candidate structures may 
be stable. Each of these structures, or phases, will have an associated probability (proportional to their respective partition 
functions Z~^). As the size of the system increases one of the phases becomes overwhelmingly more probable than the others, 
resulting in that phase being the one that is found in nature. 

In the case of finding phase boundaries, one is merely interested in determining the more probable out of two candidate 
structures Ii41ii . Therefore it suffices to concentrate ones efforts on the determination of Rba rather than the individual 
partition functions themselves. By using the Metropolis algorithm one may, in principle, estimate Rba by taking the ratio of 
the times spent in the two phases (Eq. II. 8t . In practice however, a Metropolis algorithm which generates (via the sampling 
distribution tt!^, Eq. I1.46t macrostates according to their canonical probabilities (Eq. II. 6> will remain trapped in the phase in 
which it is initiated, resulting in the systematic errors alluded to in section [T. 3. II Though this problem may be overcome by 
appealing to an appropriate extended sampling strategy, the transition of the simulation through the intermediate regions [A/]/ 
results (in the case of transitions to an ordered phase) in the formation of defect-rich structures. One way around this to use 
the reference state technique. An alternative is the PM technique, in which one maps configurations of one phase directly onto 
those of the other phase (see figure [T3i . This in principle allows one to estimate Rba from a simulation performed in a single 
phase via Eq. 11.471 In practice however even this method fails (in a way that will be described in greater detail in section l23t 
due to a milder version of the original overlap problem, which, in the context of the PM method, translates to configurations 
of probable macrostates of [M]a being mapped onto configurations of improbable macrostates of [M]b under the operation 
of the PM. As is the case with the continuous path technique this overlap problem may be overcome with the aid of a suitably 
refined extended sampling strategy (see section l2'.4. 3112.4.71 and chapter|5j- 
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phase A 




Figure 1 .3: PM technique for a simulation initiated in pinase A 
By employing a global configuration space displacement vector D, one may map the configurations of phase A directly onto 
those of phase B. The benefit of this is that it allows one to sit in the parent phase (A) and calculate Rba simply by mapping 
the configurations onto those of the conjugate phase (B). This operation does not require one to traverse the intermediate 
region of configuration space characterised by the macrostates [M]j. However one finds that for most systems of interest, the 
configurations of typical (or probable) macrostates of phase A are mapped onto the configurations of a-typical macrostates 
(macrostates of low probability) of phase B. This is again another manifestation of the overlap problem (see section l23l , and 
results in a poor estimate of Rba via Eq. 11.471 



CHAPTER 1. INTRODUCTION 



36 



The focus of this thesis will be on the development of methodologies of tackling the problem of estimating FEDs via 
the PM formalism, and will ultimately lead to their application to the calculation of quantum FEDs. In the next section we 
will start off by mapping (by using the framework of the PM method) the problem of determining the ratio of the partition 
functions given in Eq. ll.Sl onto that of evaluating the ratio of the partition functions associated with two systems with different 
configurational energies. This will allow us to present the overlap problem in a quantitative fashion. We will then review 
the array of methods that have been engineered over the last 30 years in order to address this sort of computational problem, 
before proceeding to discuss our own investigations on the problem. 



Chapter 2 



Review 



2.1 Introduction 

In this section we will formulate the task of estimating the FED within the PM formalism, thus mapping the problem of 
evaluating the ratio of two integrals involving a single configurational energy (see Eq. II. 8> onto that of determining the ratio 
of integrals involving two different configurational energies. We will proceed to define what we mean by overlap and then 
discuss the overlap problem in the context of the PM formalism. Following this we will review an array of methods which 
have been designed to tackle the problem of evaluating the ratio of integrals involving two different configurational energies. 
Some of what is presented here is new (in particular the (unifying) formulation of the estimators in terms of the macrovariable 
Mba and the process switching generahsation of the fast growth method, section lZ4.9> . We include it here for the sake of 
providing a coherent presentation. 

2.2 Formulation of the problem 

Suppose that R-y denotes a reference configuration that is consistent with phase 7. One may then express the position vector 
r in terms of the displacements u about R-,,: 

r = R^ + u (2.1) 
The degrees of freedom may now be parameterised through the variable u (instead of r) so that Eq. ll.Sl mav be written as: 

J-g-/3£;(RB+u)^^ 

where the Jacobian of the transformation from the variable r to u is unity. By expressing the degrees of freedom in Eq. I2.2l in 
terms of the variables u, we have effectively switched from the r representation (see Eq. II. 8t to the u representation. 

By comparing Eq. l2.2l to Eq. 11.431 we immediately see that the expression for Rba in Eq. l2.2l involves a PM in which the 
configuration of phase A is mapped onto a configuration of phase B via the global configuration space displacement 
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D = Rs (2.3) 

We will refer to this mapping, whereby configurations of one phase are mapped onto those of the other via the global configu- 
ration space displacement of Eq. I2.3l and in which the displacements of the particles from their lattice sites are matched in the 
two phases, as the real-space mapping (RSM) lO, I34I - I37I . It is realised via the following operation: 

R^,u^R^,u (2.4) 

More generally, one may formulate the problem in an arbitrary representation. Consider writing the displacement u as a linear 
transformation of some generalised coordinates v (which we will call the effective configuration of the system): 



(2.5) 



so that: 



r = R.^ + T^v (2.6) 

We may now use the effective configuration v to parameterise the degrees of freedom of the phase. Substituting Eq. l2.6l into 
Eq. I1.7l we find that: 

= det J dve-'^^^(^) (2.7) 

where 

£;^(v) = £;(R^ + T^v) (2.8) 
If we express the configurational energy about that of the reference configuration: 

E^{v) ^ E{Il^) + E^{^r) (2.9) 
then the free energy difference between the two phases may be written as: 

AFba = AE°sA - P'^ In det Sba ~ In^lsyi (2.10) 

where 

AE%A = E{IiB)-E{IiA) (2.11) 
is the contribution of the reference state configurations to the FED, while 



Sea — Tb.T^^ 



(2.12) 
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with 

Zb /dve-'3£«(") 

By comparing Eq. l2.13l to Eq. 12.21 we see that the expression for JI-ba in Eq. l2.13l now involves a PM in which the effective 
configuration v is preserved on the transition from one phase to the other; in other words the PM matches the v coordinates 
of the two phases. This mapping, in which the coordinates r-^ of phase 7 are mapped onto the coordinates r^;, of phase 7 (such 
that they share the same effective configuration v) may be reaHsed in u space via the operation: 



R^, u ^ R;y, S^y^u (2.14) 

where S;^^ is given by Eq. 12.121 The quantity 51-^ a in Eq. 12. 131 is ubiquitous in a variety of fields. It is also the starting 
point for a string of literature concerned with the task of estimating the FEDs ID, 1381 . I43I - I55I (for reviews see jsj, II13I . 
II22I . |^3|, I56I - I61I V We note that whereas Eq. ll.Sl is the ratio of two integrals involving a single configurational energy, 
Eq. I2.13l is the ratio of two integrals with different configurational energies. In both cases the regions which contribute most 
significantly to the two integrals come from different regions of configuration space. However in the case of Eq. 12321 the 
disparity that exists between these two regions of (effective) configuration space may not be as great as it is for the two regions 
of (absolute) configuration space within the original r formulation (see Eg. ll.St . We illustrate this idea schematically in figure 

EH 

Despite this scope for improvement, the overlap problem, albeit a milder version, generally persists. In figure 0(b) this 
corresponds to the fact that the most typical regions associated with phase A do not overlap with the most typical regions 
associated with phase B. In the following section we will explain exactly why this poses a problem for the task of estimating 
the FED. In section we will then proceed to review the methods that have been developed in order to estimate quantities 
of the form of Eg. 12. 131 



2.3 The overlap problem 

In order to discuss the overlap problem in a semi-quantitative way, let us first define a quantity which measures the energy 
cost in switching from a configuration in phase 7 to a configuration in phase 7 via the mapping of Eq. 12.141 

M^^(v) =/3{£^(v)-£^(v)} (2.15) 

Following the earlier definition of the conditional probabihty of a macrovariable (Eq. II. 6> . one may define the probability of 
observing a macrostate Mg^ conditional on the sampling distribution of phase 7, tt^, as: 

P{M*sA\n;) = J S{Mba{^) - Af^^)7r^(v)dv (2.16) 
We will refer to this as the phase constrained distribution of phase 7. 
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a) r representation b) v representation 




Figure 2.1 : Schematic of phases in different representations 
In the original r representation, the two phases correspond to two non-overlapping regions of (absolute) configuration space. 
Schematically this is shown in (a) by two widely separated regions of (absolute) configuration space with the intermediate 
regions denoted by I. In the v representation (of the PM formulation) the degrees of freedom of the phases are parameterised 
through the effective configuration v (see Eq. 12. 6> . If a good choice of representation (i.e. R and v) is made then the 
distinction between the two phases becomes considerably less in the 'effective' configuration space (see (b)) than it is in the 
original r representation (see (a)). 
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In order to understand the behaviour of the phase-constrained distribution ^(A/baI'''^) let us analyse the behaviour of 
the macrovariable Mjj. This macrovariable is under certain circumstances an order parameter for the system. To see this let 
us first consider a hard sphere system. For this type of system a mapping of the form of Eq. 12. 141 will map a configuration 
of phase 7 (for which the hard spheres do not overlap) onto a configuration of phase 7 in which the hard spheres typically 
penetrate each other's cores. As a consequence M^-y will be positive and infinite. A similar thing will happen for a simulation 
initiated in phase 7 in which the mapping of Eq. I2.14l is performed in the opposite direction. In this case Al-y-^ will be negative 
and infinite. In this sense Af^-^ acts as an order parameter for the system. This idea carries over into systems with continuous 
configurational energies. In this case for typical effective configurations v of phase 7, the mapping in Eq. 12. 141 induces a 
configuration r;^ (given by Eq. \2.6i of phase 7 of generally higher energy than the configuration r-y. In other words one will 
find that on average Mj^ > for typical configurations of phase 7. Likewise the opposite will be true (that is that M^^ < 0) 
for the typical configurations of phase 7. The resulting distributions are shown in figure lZ2l 

We are now in a position to define (in a way which is free of ambiguity) what exactly we mean by overlap. Suppose that 
H{M BA.iWj) denotes the histogram count for bin Mba.i for a simulation performed via tt'^. The estimator for P{Mba\t^'^) 
is given by: 

H(MBAt\0 

P{Mba,K)= ^, 7 . (2-17) 

Y.,=lH{MBA,t\'^^) 

We then define the concept of overlap as follows: 



The region of overlap is defined to be the set {MBA,i } over which H{MBA,i\'^'x) '^^^ iJ (Ms^^i |7r^) {or 
equivalently P{MBA.i\'^'A) '^^'^ P{MBA,i\T^'B) ) o.re simultaneously non-zero. 



The overlap between P{Mba\t^'a) and P{Mba\t^b) quantified 1461 by introducing the overlap parameter O: 



O . / dMBA 2i-(^^B.k^)P(M..|.|) 



O may be estimated by O: 



6 ^ (dMBA ^nMBA\.^)P{MBA\.%) 

J P{MbaU'a) + P{Mba\^%) 

In the case where the estimators P{Mba\''^'a) P{Mba\t^'b) overlap considerably, both O and O will assume a value close 
to unity. In the case where P(Mba\t^a) ™d P{Mba\t^'b) do not overlap at all O will assume a small (but non-zero) value, 
whilst = 0. 

We now proceed to derive the overlap identity 1621 . which relates the probabihties of obtaining a given value of Mba in 
the two phases. From Eq. I2.16l we see that: 
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Zb 

^ ZaJ_ 
Zb Za 

= ^e-^'BAp(^M*sAWA) (2.20) 



Rearranging this equation one obtains the overlap identity: 



^.A = ---"7^(^^^^l-^) (2.21) 
P{Mba\^%) 

This identity may be used to estimate via: 

e.b. e-^---^P{MBAA^'k) 

P{MbaM^%) 

where MBA,i is any bin for which one has a non-zero count for both simulations. 

The overlap identity imposes several constraints on the arbitrariness of the phase constrained distributions P(AfsA|7r5i) 
and P(AfByi|7r^). One such constraint is the value of Mba, which we label as Mm, at which point these two distributions 
intersect. Substituting P(Mm|7r^) = P(Mm|7r^) into Ea. l2.21l one obtains: 

^-BA = e-^"^ (2.23) 



= P^Fba (2.24) 

Therefore the FED of the phases manifests itself as an asymmetry of the point at which the two phase-constrained distributions 
intersect 1^ . 

Ea. l2.22l is very important because it links the idea of the overlapping of P{Mba\''^'a) ™d P{Mba\''^'b) to ones ability to 
estimate "JI-za- It is immediately clear that in order for Eq. 12. 221 to serve as an efficient estimator for '^'ba there must be some 
regions of Mba space over which P{Mba\''^'a) P{Mba\i^b) overlap. If this is not the case (as is the situation in figure 
I2.2t then either the numerator or the denominator of Eq. I2.22l will be zero for any bin Mba,!, yielding an incorrect estimate of 
'Jl'BA- It is in this way that the overlap problem prevents one from arriving at an estimate of '51% a which is free of systematic 
errors. Furthermore as the system size increases the extensivity of M^y^ results in the means and variances associated with 
the distributions P{Mba\''^'^) scaling in such a way so as to reduce the overlap of the two distributions (see fi sure l2!2l for an 
explanation). As a consequence the overlap problem worsens as the system size increases. 

In order to tackle the overlap problem, one needs to understand the factors which affect it. From our preceding discussion 
it is clear that the overlap is dependent on two factors: 
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Figure 2.2: Schematic for P{MBA\7r'x) and P(MBA|7r|j) 
This figure is a schematic for what one would typically expect for the distributions of Mb a for the two phases. For a simulation 
in phase A, the effective configurations v that will be sampled will be typical of tt^ and atypical of tt^. As a consequence one 
will find that for most configurations £b(v) > £a(v) (meaning that Mb A is positive), which will yield the right-hand side 
peak shown in the figure above. For a simulation initiated in phase B the opposite will be true and this time the value of Mb a 
will be, on average, negative, resulting in the left hand peak. In this way Mb a acts, in some sense, like the order parameter 
M described in section n".3.2.3l 

The overlap problem essentially amounts to the lack of overlap of the two peaks P{Mba\'^a) ^^'^ ^'I^saK^), and prevents 
one from obtaining an estimate of ^-ba (via Eq. I2.22> which is free of systematic errors. Since the energy is an extensive 
quantity (so as to make Mb a extensive) , the distance of the peaks from the origin will scale with N and the spread will scale 
as Vn. As a consequence the overlap will decrease (i.e. the overlap problem will get worse) as the system size increases. 
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• The choice of the translation vector D which maps configurations of one phase onto those of the other 

• The choice of representation v 1641 . 1651 . 

The first point has been addressed before 1341 and is briefly discussed in section lSJl The second point has not been investigated 
before and in chapter|3l we wiU show how one may choose a representation, based on the normal modes of the phase, which 
does, for structurally ordered phases, cure the overlap problem as the harmonic limit is approached. Before doing this we will 
(in the next section) present an array of techniques that have been developed in order to estimate quantities of the form of Eq. 



2.4 Review of methods 

2.4.1 Introduction 

Over the years a spectrum of methods have been developed in order to address the evaluation of Eq. 12.131 These methods 
include thermodynamic integration methods I23I - I25I . the canonical perturbation methods 1431 . 1441 . 1461 . 15 31 . the simulated 
tempering methods 1491 - 15 21 . the slow growth methods 1661 - 1681 . and the umbrella sampling and the multistage methods 
II45I . 1621 . 11691 - 17 II . More recent developments include the fast growth methods 1551 . 1721 - 1751 . and the phase switching 
method of Q). We will now review a selection of the methods available for estimating FEDs, limiting ourselves to NVT 
systems 1221 . Though these approaches do, at first sight, appear to be quite different, there are common themes that run 
through all the methods; we shall try to make them clear The interrelations between the methods can be most easily seen 
by observing the way in which the sampling distributions P{AIba\t^) are constructed and by expressing the estimators in 
terms of the macrostates Mb a- We point out that some of the insights offered in this chapter were also part of the work of 
this thesis; in particular the AI ba formulation of the simulated tempering (section l2.4.6> . the Mb a formulation of the phase 
switch method ('section l2.4.7> . and the path sampling formulation of the fast growth method (section l2.4.9l l. However for the 
sake of a coherent presentation of the ideas, we have put them in this section. 

2.4.2 Canonical Perturbation Methods 

The perturbation methods are probably the simplest and earliest methods developed to tackle the problem of determining 
FEDs I43I - I46I . We will now describe the two most well known examples. 

The first method may be most simply derived by integrating the overlap identity (Eq. I2.20t over all values of Mba, 
yielding: 




(2.25) 



or 
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^BA = J e-^'^^P{MBA\Tr'X)dMBA 



= < e-"""^ (2.26) 

Eq. 12.261 (see also Eq. I1.45> is what we refer to as the exponential perturbation (EP) method, ^-ba may then be simply 
estimated by: 

6 

3^3/1 =■ ^P(MsA.^kl)e-^^-^- (2.27) 



or: 



J?. Y.I=iH{MbaA^'a 



^^=^^^ — " (2.28) 



using the fact that P(Mb^. ^ |7r^) is given by Eq. 12.171 

Generally an attempt to estimate the FED via Eq. 12. 281 will fail. The reason for this ultimately stems from the fact that 
the regions which contribute the most to the numerator of Eq. I2.28l will be those regions over which P{Mba\''^b) most 
significant fTSi l. which, in the general case illustrated in figure I2T2I is not contained within the regions typically explored in a 
sampUng experiment performed with tt^. As a result systematic errors will be present in ones estimate of 'Rt.a- 

The second method is based on the identity: 



where A is the Metropolis acceptance function (Eq. ll.29t . Using this identity in Ea. l2.21l we get: 

'51-saA{~Mba)P[Mba\^'b) = A{Mba)P{Mba\t^a) (2-30) 
Integrating and rearranging this equation we find that: 

Eq. I2.31l is what is known as the acceptance ratio (AR) method |46|. Since A(M^-y) is the acceptance probability of a 
Monte Carlo move in which one attempts to switch the configurational energies from £^ to £^ whilst keeping the effective 



configuration v constant (we will refer to such a Monte Carlo move as a phase switch (PS), see section l2.4.7l i. we see that the 
AR method estimates iK^yi by evaluating the expectations of the acceptance probabilities for the phase switches in the two 
phases, without actually performing them as Monte Carlo moves. It follows from Eq. I2.31l that may then be estimated 
via the identity: 

e^. Y.\^^HMBAA'n\)A[MBAA ^2 32) 

Y.\=yP{^IbaMb)A{~MbaA 
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Since the AR method (Eq. I2.31> rests on the overlap identity (Eq. I2.21> . it follows that its estimator (Eq. I2.32> implicitly 
assumes that some sort of overlap exists between the estimators P{Mba\'^'a) ™d P{Mba\t^b)- ^'^^ immediately clear 
to what extent overlap is needed, and the insight into this shall be provided later in chapter|3 For the moment we shall remain 
content with the fact that in the general case an attempt to estimate ^-ba via Eq. I2.32l will fail due to the absence of overlap 
between the estimators of the phase-constrained distributions of the two phases (see figure l2!2t . 

Eq. 12.261 and Eq. 12.3 II are particular cases of a much more general formula originally derived by Bennett 1461 . This 
formula may be simply obtained by multiplying the left and right sides of the overlap identity (Eq. 12. 2H by an arbitrary 
function G{Mba) and integrating over all values of Mba- 

^BA j G{MBA)P{MBAWB)dMBA-- j G{MBA)e-^'^^ P{MBAWA)dMBA (2.33) 



^^■^ = 'Am W (2.34) 
For the choices of G{A'Iba) — 1 one obtains the EP formula (Eq. I2.26> and for G{Mba) — A{—M ba) one obtains the AR 
formula (Eq. 12.311 1. In general Eq. 12. 341 will require two separate simulations, one in each phase. For this reason it will be 
referred to as the dual phase (DP) formula. 

The perturbation methods described thus far (Eq. 12.261 Eq. 12.311 and Eq. I2.34> rest on simulations in which configurations 
are sampled according to their canonical probabilities (see Eq. I1.46> . The spectrum of methods that we will now review go 
beyond this and rest on the employment of sampling distributions which are different from the distributions with respect to 
which the expectations are performed (see Ea. ll.32t . 



2.4.3 Umbrella Sampling and Multicanonical Methods 

The underlying idea behind the umbrella sampling method of Torrie and Valleau 1451 . 1621 . 1691 - 117 II is that of sampling with 
a distribution tt™ which is different from the sampling distribution tt^ with respect to which the expectations are evaluated. 
That is by applying the identity of Eq. ll.32l to Eq. I2.26l thev obtain the following identity: 



(2.35) 



Torrie and Valleau considerably simplify the problem of constructing an alternative sampling distribution by noticing that the 
multidimensional quantity in Eq. I2.35l can in fact be solely expressed in terms of the statistics of AIba, provided that the 
ratio ttJ^/tt™ can be expressed as a function of AIba- By constructing an alternative sampling distribution tt^* by appeal to a 
weight function t]a{AIba)- 



P{MBA\7T7)^P{AlBA\7T'A)e-''-^^'--^ 



(2.36) 



CHAPTER 2. REVIEW 



47 



they use this idea to rewrite Eq. I2.35l as: 



^ „~Mba P(Mba\^'a) . 

^ (2.37) 



3?cByi may then be estimated via: 



BA 



(2.38) 



Eq. l2.38l also directly follows from Eq. I2.27l bv noticing that instead of P{MBA,i\Trj) being given by Eq. 12.171 it is now given 
by: 



P{MBA,i\^-y) = —h — KTTZ , ^ (2.39) 



f'(MB^,,k")e''^^'''^^-'^ 
E'=iWBA,,k:?)e''^(^^«^..) 

As we saw in section 12.4.21 the reason why the EP method (Eq. I2.26> fails to serve as a useful estimator is essentially due 

to the fact that P{AdBA\''^'A) does not contain P{Mba\t^b) II22J- The umbrella sampling method overcomes this problem by 

constructing a sampling distribution tt™ so that P{Mba\''^^) contains both P{Mba\''^'a) -P(-^^Byiks)- 

The construction of tt^ is however, just as difficult a task as that of finding the FED of the two phases 178 1. and in 
the original papers 1451 . 1621 . 1691 - 17 II no scheme was provided for constructing the sampling distribution tt™. Instead 
they acknowledged that the task of finding an appropriate tt™ was "tedious" and suggested performing several independent 
umbrella sampling simulations, with the umbrella distributions overlapping at the edges, in order to cover the desired region 
of (effective) configuration space. 

Recently the work of Torrie and Valleau has come alive again in the works of Berg and Neuhaus 1791 . 1801 in which um- 
brella sampling was reinvented under the name of the multicanonical (MUCA) algorithm 181 1 . This time, however, an efficient 
prescription for constructing the umbrella sampling distribution was provided. Within the context of umbrella sampling the 
MUCA algorithm may be thought of as the two fold process: 

• One constructs an estimate for P{Mba\''^'x), which we will denote by P{AdBA\''^'A)' o^^r ™ arbitrary range of A4ba 
space. 



One then samples from the MUCA sampling distribution: 



^J{v)=-. ^ (2.40) 

P(MsA(v)ki) 



via the acceptance probability of Eq. 11.251 
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It follows that the probability of observing a macrostate M ba under the MUCA sampling distribution tt^* is given by: 



P{M*baUJ) ^ I 5{Mba{^) - M*BA)n^{^) 
S{Mba{v) - M*ba) 



P{M*ba\ 



P{Mba{^)W^a) 

(2.41) 

PiM*B^\7:^;,) 

In the case where the estimate is perfect (that is P{Mba\t^a) — ^(^^saI'I'a) ) P{Mba\t^a) constant for all the range of 
Mba space. Therefore by obtaining a sufficiently accurate estimate of P{Mba\t^a) o^^^" '^^e desired range of Mba space, 
one may construct an umbrella (or MUCA) sampling distribution tt™ (via Eq. I2.40> which is flat (in Mba space) over that 
range. 

There are several methods of generating these MUCA weights and we will discuss the two simplest procedures. The 
first method (which is called the visited states method 1791 . 1821 ') starts off with an initial estimate in which all the MUCA 
weights rjA {MBA,i) are set to be equal to each other. One then performs a simulation (or iteration) for tmui Monte Carlo steps 
using the MUCA sampling distribution Eq. 12.401 The histogram H{MBA,i\T^A) ^^''■^ subsequently obtained is then used to 
improve the estimate of the MUCA weights through the identity EH, l82l-l84l: 

VA.j + liMBAa) = VA.jiMBAa) + HH{MbaA^a) + 1] (2-42) 

where {r]A.j{MBA.i)} denotes the weights of the current iteration, and {riA,j+i{MBA,i)} denotes the weights of the next 
iteration. Under this updating scheme macrostates which have been visited will have their weights increased, and as a result 
the probability (see Eq. I2.36t of visiting these macrostates is reduced for the next iteration. On the other hand the weights 
of macrostates which are not visited at all are left unaffected, so as to increase their chance of being visited (relative to those 
macrostates which have been visited) in the next iteration. By iterating this procedure, one may eventually obtain an accurate 
set of MUCA weights {r]A] over the desired range, which one may then use to construct a MUCA sampling distribution (Eq. 
I2.40t which is flat over an arbitrary span of Mba space. 

The second scheme for constructing the MUCA weights rjA is a modification of the Wang-Landau method I85I - I87I . In 
this method the time for each iteration corresponds to a single Monte Carlo step. That is one updates the weight after each 
Monte Carlo step (tmui = 1) via the update scheme; 

77Aj(MBA,i) + In/ : if bin MsA.i is visited 
■nA,j+i{MBA.i) ^ { (2.43) 

fiA.j{MBA,t) ■ otherwise 

The idea is to start of with an f greater than unity, and perform the update scheme of Eq. I2.43l until one obtains a flat histogram 
1881 . One then reduces f (but at the same time constraining it to be greater than unity) and repeats the simulation, this time 
starting off with the set of MUCA weights obtained at the end of the previous simulation. This procedure is carried out until f 
has reduced to a value close to unity, at which stage one will have obtained a sufficiently accurate set of MUCA weights so as 
to ensure that P{Mba\'^a) i^ range of Mba space containing both P{Mba\'^a) ^'^'^ P{Mba\t^b)- Using this 
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estimate one may perform a final simulation (in which the weights rj^ are unmodified) and use Eq. I2.38l in order to estimate 
^Ji-BA- In chapter |5l we will illustrate in greater detail the use of the umbrella sampling strategy (constructing the MUCA 
weights via the Wang-Landau method) in estimating the desired FEDs. 

2.4.4 Multistage Methods 

The multistage (MS) method may be considered to be a generalisation of the DP (dual phase) methods II53I . II54I . The central 
idea of this method 1451 is to break up the task of evaluating the FED into a series of independent tasks of estimating the FEDs 
between pairs of systems whose overlap is considerably improved with respect to the original pair. The method is based on 
the construction of a chain of configurational energies: 

{£a, =£a,£a.,£a3,...,£a„ =£5} (2.44) 
in which the configurational energy has the associated sampling distribution: 

7r;,^(v) = e^'^^^^^^) (2.45) 

By noticing that: 

^TiA = -^^...j (2.46) 

where 

Za, = / dve-''^^'(^) (2.47) 



and by constructing a chain of configurational energies in such a way that P{MBA\T^\i) and P{MBA\T^\i+i) sufficiently 
overlap, one may estimate each ^^-^ via any one of the techniques presented in this review. 

The standard (and simplest) way of consti'ucting £.\. is via the following Unear interpolation scheme: 



^A,(v) = A,£b(v) + (1-A0£a(v) 

= £aM+P'^\Mba (2.48) 

where Ai = < A2.... < A„ = 1 189]. As an example one may evaluate Ea. l2.46l via the EP method (EQ. I2.26t 1901 : 



n-l 

^'BA = n < e-'^I^^'+i^'^-^^'^")! >,,^ (2.49) 

i=l 

which, for the case of Eq. 12.481 may be written as: 

n-l 

^^A = l[< e-^^'*'^--^(^) >,,^ (2.50) 
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where 

AA, = A,+i-A, (2.51) 

Eq. I2.50l is interesting in its own right since it reduces to the well known method of thermodynamic integration I23I - I25I in 
the limit of sufficiently small {AAj}. To see this we note that for sufficiently small AA one may expand the exponential in 
Eg. 12. 501 as a power series in AA so that: 

n-1 

^Fba = -13'' InJi^ji = ln( J] < 1 - AX^Mba + 0{AX^) >^,J 

1=1 

n-1 

1=1 
n-1 

« < AA.MsA 

i=l 

« / dA<-^>,, (2.52) 
ja=o 

where in expanding the exponential of Eq. I2.50l we have neglected powers of order AA^ and higher. This is valid provided 
that there are a sufficient number of configurational energies Unking £ 4 to £5 (see Eq. I2.48t . so as to ensure that the AA^ 
are sufficiently small. We make a point to note that though the appearance of Eq. 12.521 is identical to that of Eq. 11.371 it 
has incorporated within (as do all the other methods that are being reviewed in this section) it the PM formalism of | IJ. This 
allows it to be used to directly estimate the FED between the two phases, rather than having to use it to evaluate the FED 
between each system and some reference system, as is the case in the original formulation (see Ea. ll.37t . 

2.4.5 Weighted Histogram Analysis Metliod 

The weighted histogram analysis method (also called WHAM I9T1 - I94I ) is a generalisation of the histogram re-weighting 
techniques of 1951 and 1961 . and employs a minimum variance estimator which uses the re-weighting of data from several 
independent simulations in order to calculate the FEDs. The original derivation is very involved and we follow the simpler 
derivation given in l54l . 

Suppose that one constructs a chain of configurational energies {£>; } linking £a to £5 (which we take to be the particular 
case of Eq. I2.48> . whose corresponding sampling distributions are denoted by n\. , Eq. 12.451 The WHAM method is based on 
the observation that if one performs several independent sampling experiments with the sampling distributions tta^, tta^, 
7rA„ , in which Nj independent samples are obtained for the sampling experiment performed with 7T\. , then the probability of 
observing a macrostate Mb a within the collection of data, as obtained from all the simulations, is given by: 

1 " 

P{Mba) = —y^N,P{MBA\7T^J (2.53) 

n 

Nt = Y.^^ (2.54) 

i=l 
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The underlying idea of the WHAM method is simple. The strategy is to construct a set of distributions {P{MBA\'^\i)} which 
overlap and connect the regions of (effective) configuration space associated with one phase to those of the other 1971 . With 
this choice the resulting P{Mba) will contain both P{Mba\'^a) '^"'^ ^'C-^^baItt^). To arrive at an expression which will 
allow one to estimate the partition functions {Z\. } (up to a multiplicative constant which is the same for all the estimates) one 
starts off by inverting the expression given in Eq. 12.531 



iVr ^ P{MBA\Trxj) 

= ^(^^^^K^.)^jV^|^,-(A.-A,)M.. (2.55) 

^ 7 = 1 ^ X i 



~7 P(AT W \ NTP{MBA)e-^^''-- 



Summing over all bins in Eq. 12. 5 61 and using Eq. I2.53l it is clear that Zx^ may be estimated by: 



^ c^. ^^ NTPiMBAM)e-^^^''^^-'' 

k=l ^1=1 

^ X1=^H{MbaA->^J^-'^''^^^\ 
k=l l^i=\ z^. ^ 

The set of equations given by Eq. l2.57l form the core of the WHAM method 119 81 . It is immediately clear from Eq. I2.57l that 
in order to estimate the ratio 'JI^ra = Z\^l Z\^ one must estimate the partition functions (up to a common constant) of all 
the sampling distributions associated with the configurational energies {£Ai} (see Eq. I2.44> linking £yi to £5. In order to 
estimate the {Z\.} one must solve Ea. \2.57\ iterativelv. One starts off with a set of estimates for {Zx^}, which one then feeds 
into Eq. 12. 571 to get a new set of estimates. One then feeds these estimates back into Eq. 12. 571 to get yet another set of even 
more accurate estimates. One carries out this iteration until the set of estimates have suitably converged. At this point the 
estimate of Zx^j Z\^ will yield an accurate estimate of '^•ba- The power of the method clearly lies in the enormous scope for 
parallelizability that exists in the construction of the path linking phase A to phase B. 

2.4.6 Simulated Tempering 

The simulated tempering method 1491 - 15 21 . like the multistage method, involves the construction of a chain of configurational 
energies linking Za to £3 (see Eq. 12.441 1. The basic idea behind the method is to simulate from a sampling distribution 
characterised by the following partition function: 

ZsT = ^ZA.e-^'sT = ^ / dve-''^^'(")-''sT (2.58) 

i=l i=l 
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where Z\. is the partition function associated with the sampling distribution ttx. (see Eq. 12.451 and Eq. I2.47t and rig)p are 
some weights. We will refer to each Z\. as a sub-ensemble. The idea of the method is to construct a Zst so that the 
corresponding sampling distribution explores (evenly) all the regions of (effective) configuration space 'containing' the two 
phase-constrained distributions P{Mba\t^a) ^'^'^ ^(^^baIttIj) and the regions in between them. One way t46i of reahsing 
Eq. I2.58l is to sample via the sampling distribution: 

n 

7r(v)==^e-''^^.(^)-''sT (2.59) 

An alternative sampling distribution (the one used in 1491 - 15 21 ) is one which 'hops' between the sub-ensembles and in which 
two types of Monte Carlo moves are employed. The first type of move involves the usual particle displacement. Such moves 
are accepted with the probability given by Eq. ll.25l where 7T\^ is used in place of tt if the simulation is in sub-ensemble i. The 
second type of move attempts to switch sub-ensembles. That is this move keeps the effective configuration v constant whilst 
trying to change the sampling distribution from tta^ to tta^^, (generally m' is chosen to be an adjacent sub-ensemble of m). 
In order to satisfy detailed balance (Eq. 11.191 1 and in order to yield a sampling distribution with a partition function given by 
Eq. 12.581 such a move must be accepted with probabihty: 

Pa{m^m'\nsT)^Mm{l, } (2.60) 

g-/5£A„(v)-?)J,j,' 

Accordingly we may write the sampling distribution as: 

7rsT(v,z)=e-''^^.(^)-''sT (2.61) 

where i is a stochastic variable assuming any integer value between (and inclusive of) 1 and n. Unless otherwise stated, we 
will assume hereafter that ttst corresponds to the sampling distribution given in Eq. 12.611 

Suppose that Pm denotes the probability of finding the simulation in sub-ensemble m and suppose that Tm denotes the 
time spent in the sub-ensemble m: 



T„, - ^A™(v(i)) (2.62) 



i=l 



where {v(l), v(2), v(i)} denotes the sequence of effective configurations generated by the simulation, and where: 

1 : if is a configuration generated in sub-ensemble m 
A™(v(^)) = <; (2.63) 
: otherwise 

Under the sampling distribution ttst, it follows from Eq. I2.58l that since the ratio of the probabiUties of the simulation being 
found in any two sub-ensembles is given by: 
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and since the ratio of the probabilities of finding the simulation in two sub-ensembles is estimated by the ratio of the times 
spent in the two sub-ensembles: 

then the ratio of the partition functions of and Z\^^ may be estimated via: 

— e'-^sT ^ST> (2.66) 

It follows from Eq. I2.66l that the quantity "Si's, a may then be estimated from the ratio of the times spent in the sub-ensembles 
1 and n: 

c.b. Tn ri^^^ ~n^^^ //-» 

^'RA ^ Tjre^^^ (2.67) 
J-i 

In order to arrive at an estimator in terms of the macrovariable M ba (as has been formulated in sections [2.4. 31 and l2. 4. 5l for 
the umbrella and WHAM methods respectively) we first note that: 



P{Mba\t^st 

i=l 
n 

= Y,P{MBA\7^^,).P{X^\7^sT) (2.68) 

1=1 

where P{Mba, ^iIt^st) denotes the joint distribution of observing the macrostate Mba and of being in sub-ensemble i under 
the sampling distribution ttst, and where P{Xi\TrsT) denotes the probability of being in sub-ensemble i under the sampling 
distribution ttst- From Eq. I2.58l and Eq. 12.6 H it follows that: 

nM'r^ST)= /' (2.69) 

By noting that in the case where the configurational energy is linearly parameterised fEa. 12.48V 

P{MbaVst) ^ P{Mba\7tx,) I , 

PiMBAlT^A) ^P{Mba\t^\o) 

~ n 

— ^Aq ^-\iMBA--rfKT (2.70) 



One may use Eq. l2.37l to arrive at an estimator for 3^syi in terms of the macrovariable Mba'- 



M„A P{Mba\^''a) 

^^■^ - , P(Mua\^^,) ^ 

^ P[Mba\itst) ^^st 

< 7^7 -T- >r 



(2.71) 
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The estimator in Ea. l2.71l is also valid if, instead of ttst, one employs the sampling distribution given in Eg. 12.591 

By hopping between the sub-ensembles the simulation is able to explore a wider region of (effective) configuration space 
than it would under a sampling experiment performed with any one of the individual sampling distributions tta^ . In order 
to ensure that the simulation visits all sub-ensembles, one must first ensure that a sufficient number of intermediate sub- 
ensembles have been constructed (so that P{AlBA\T^\i) overlaps with P(iV/B^|7rAi^i )). Secondly one must also ensure that 
the correct weights {rig)p} have been chosen so as to guarantee that the simulation is able to frequently traverse between 
the regions of (effective) configuration space typically associated with phase A and those typically associated with phase B. 
One way to do this is to choose the weights so that equal times are spent in all the sub-ensembles. In this case one sets 
''1st ~ 1^ + constant. However since a-priori the partition functions are not known, it follows that the weights must be 
constructed in an iterative fashion (e.g. via the visited states method or the Wang-Landau method) as is done in the Umbrella 
Sampling method (see section l2'.4.3l l. Having obtained the weights one may then proceed to estimate the ratio of the partition 
functions 'JI-ba by appeal to the estimator in Eq. l2.67l or Eq. 12.711 

2.4.7 Phase Switching Method 

The Phase Switching (PS) method, along with the whole phase mapping (PM) formalism, was originally developed in (l] (see 
(S) for a review and see I34I - I37I for generalisations). In order to motivate the method consider the case where one samples 
from a sampling distribution which is associated with the following partition function (which we refer to as the canonical PS 
partition function): 

Zps^ j rfve-'^^-^(^) + j dve-^^«(^) (2.72) 
An example of a sampling distribution associated with such a partition function is: 



^(V) = e-/3£A(v)^g-;3£B(v) 

= e-'^^-^(^)[l + e-*^«'*(^)] (2.73) 

From the discussion of section 12.4.61 it is clear that Eq. I2.72l mav be equivalently realised by what we call the canonical PS 
sampling distribution: 

7r|,5(v,7)=e-/'^-(^) (2.74) 

where 7 is a stochastic variable, which assumes one of two values: 7 = A or 7 = B. Eq. 12.741 is clearly a special case of 
the ST sampling distribution ttst in which one has only two sub-ensembles corresponding to the two phases and in which 
the weights {risr} are the same for both the phases. The sampling distribution n'pg then accepts a 'switch' between the two 
systems with the following probabiUty: 
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Min{l,e~*^«^} 

(2.75) 



The PS sampling distribution TTpg (and also the alternative sampling distribution in Eq. I2.73t then yield the following distri- 
bution: 

P{Mba\i^ps) = H H (2-76) 

ZiA + 

For the PS sampling distribution Upg the ratio of the partition functions is then given by: 

/dve-'5£«(-) 



■51 



< Ab(v) >^|.^ 

< Aa(v) >^c^ 

which may be estimated by the ratio of the times spent in the two phases (see Eq. I2.67> : 



(2.77) 



c.b. E»=iAb(v(z)) 



where A^(v) is given by: 



1 : if v(i) is a configuration is generated in phase 7 
A^(v(i)) = \ (2.79) 
: otherwise 



More generally (see Eq. I2.71> one may estimate 'JI'ba (for both Eq. I2.73l and Eq. I2.74> from the estimator corresponding to 



< UMba) >7r<^ 



where /(x) is the fermi function: 



1 + 

It is clear from Eq. I2.75l that a PS Monte Carlo move (in which one attempts to switch phases whilst keeping the effective 
configuration V constant) is only likely to be accepted within the |Mba| < 0(l)regions. From section l2.3l we saw that the vast 
majority of equilibrium configurations of the parent phase will be mapped (under the operation of the PM) onto configurations 
of the conjugate phase which are of higher excitation energy. Therefore the probability of visiting configurations for which 
the two phases are of roughly the same energy (under the operation of the PM) is negligibly small (see also figure l2!2t . Even 
if one considers the more general case of Eq. 12.5 81 where one introduces two weights into Eq. 12. 721 so that: 



(B) 

(2.82) 
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the problem will not be solved since, in the absence of overlap, these weights will at best allow the switching to take place only 
in one direction. As we saw in section l2'.4.6l the simulated tempering method solves this problem by constructing a series of 
intermediate systems so as to engineer overlap between adjacent systems. For such pairs of (sufficiently overlapping) systems 
the weights can be chosen so as to ensure that switching takes place frequently in both directions. 

The way the PS method overcomes the overlap problem is by using a set of weights which are themselves a function 
on (effective) configuration space. That is rather than simulating via a sampling distribution characterised by the partition 
function in Eq. l2.82l one instead employs a MUCA sampling distribution associated with the partition function: 

The sampling distribution (which we call the MUCA-PS sampling distribution) is then given by : 



_^ „-/3£^(v)-,)j,s(Msa(v)) 



(2.84) 



which has the associated Mba distribution: 



P{MBA\7T'Ps)^e-'T''^^-^'>PiMBA\7r'ps) (2.85) 
Eq. I2.83l mav be equivalently realised via the sampling distribution: 

7r(v)=e"^^-"(^'"''(*^«^'[l + e~*^^-"(^)] (2.86) 



It is immediately clear that the probability of a PS Monte Carlo move, as dictated by Eq. 12.841 is given by Eq. 12.751 since a 
PS does not change the value of M ba- 

Pa{A^B\n-s) = Min{l,^^j^} 
= Min{l,e~*^«^} 

(2.87) 

On the other hand the probability of accepting a move from a configuration v to a configuration v' is now given by: 



Pa(v^ v'Itt^s) = Min{L 



'r??s(v,7) 

-i3E^{v')~riPs{MBA{v')) 



= Min{l, — — ^ , } (2.88) 

Therefore the role of the MUCA weight function riPs{MBA) is not, as is the case in the ST method, to aid the simulation 
(directly) in switching between phases, but instead to guide the simulation to regions of (effective) configuration space from 
which the simulation can easily switch phases. 
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In order to determine JI-ba one applies the reweighting formula (Eq. I1.32l i to Eq. 12.771 (so as to remove the bias of the 
MUCA weights in Eg. lTSSt : 

The corresponding estimator is then given by: 



^■BA = t^r' ... ... :. . (2-90) 



Similarly the more general estimator given in Eq. 12.801 valid for both Eq. 12. 841 and Eq. 12.861 is now replaced by 



Figure 1231 (a) shows a schematic for P{Mba\t^ps) (b) shows a schematic for P{MBA\T^f^s)- clear from (a) that a 
canonical PS sampling distribution TTpg initiated in one of the two phases will remain stuck in that phase, since the probability 
of visiting the AIba ^ regions is negligibly small. In order to obtain the estimator of the canonical distribution shown 
in (a), one would first have to perform a MUCA simulation as shown in (b) and then reweight the data appropriately (see 
1991 for details). The essential feature of the MUCA distribution P{Mba\'^i^s) ^^^^ contains both the phase constrained 
distributions P{Mba\'^a) '^^'^ P(Mba\t^b)- Figure l2!4l shows a schematic of the PS procedure. 



2.4.8 Fast Growth Method 

The fast growth (FG) method [55 1 rests on a result called the Fluctuation Theorem. This theorem has been proved for a variety 
of non-equilibrium processes 1551 . 1721 - 1741 . I100I - I105I . The particular formulation that we will draw on is set out in 1551 . 
I72l-l74l. 

Central to the Fluctuation Theorem is a non-equilibrium process that we will now describe. One starts of by constructing 
a configurational energy £.\ which is a function of a field parameter A (for example see Eq. I2.48t . The field parameter A takes 
any value between Ai and A„, and the set {£a} forms a chain of configurational energies linking £yi to £b (see Eq. I2.44> . 
The A ^ B non-equilibrium process, which takes us from phase A to phase B, involves the switching of the field parameter 
A from an initial value of Ai =0 to A„ = 1 in a series of discrete steps at some predefined, but arbitrary, set of times {t} = 

{tl,t2, tn-l}- 

The implementation procedure may be realised as follows. The initial point v(l) is sampled with respect to the canonical 
distribution tta^ ~ tt^. One then increments (at time ti) the field parameter A from Ai to A2; in doing so one performs 
a (temperature scaled) amount of work (which we will simply refer to as work for the rest of this thesis) SWBA{ti) = 
(■^(1)) ~ ^Ai (v(l))} on the system. The system is then equilibrated via the sampling distribution ttx^ until a time t2 
yielding the configuration v(2). At this point one increments A from A2 to A3 so as to yield the work increment SWBA{t2) = 
/3{£a3 (v(2)) — (v(2))}. One continues this process until the value of the field parameter has reached its terminal value, 
A„, at time tn-i- 

As a result one obtains the path: 
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States for which ii switch of phases 
'h likely to be accepted 




P(M,.) 



H * [■ 



(b) 



M 



BA 



Figure 2.3: Schematic of probability distribution of Mba 

(a) The canonical distribution P{Mba\t^ps) 

(b) The multicanonical distribution P{MBA\T^iis) 

In order for the simulation to be able to reach those regions characterised by Mba ^ 0, one must employ MUCA weights 
(through Eq. ITPt . See also figure IZ4l 
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states from which a switch of 
phases is likely to be accepted 



V 




Figure 2.4: Schematic of tine PS algoritlnm in r space 
The diagram shows a simulation starting at a typical ('equilibrium') configuration of phase A . The employment of the weights 
{t]} in Eq. I2.84l means that the simulation performs a random walk (in Mba space) to the region B. For these configurations 
Mb A ^ and therefore a PS (via Eq. I2.87^ has a chance of being accepted. An accepted PS then takes the simulation to 
region C, which just corresponds to a switching of reference configurations (see Eq. 12. 6> . Then under the influence of the 
weights the algorithm performs a random walk in Mba space from region C to region D, which is the equilibrium region of 
(absolute) configuration space of phase B. The corresponding points A, B, C, and D are also shown in figure l23l 
The utility of the macrovariable Mba (as was mentioned in the discussion of Eq. I1.13> is that it serves as a guiding parameter 
for the simulation. In the case of the PS method, it is used to guide the simulation to regions of (absolute) configuration space 
from which a switch of phases has a non-negligible chance of being accepted. 
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t ti,t2, ■■.,tn-l 

V ^ v(l),v(2),...,v(n-l) = {v} 
X{t) Ai,A2,...,A„ (2.92) 

The net work for the A B process is then given by: 

n— 1 n—1 

Wba{M) = Y.^WBA{k) = /3^[£a.+i(v(z)) - £a.(v(z))] (2.93) 

i=l 1=1 

In the case where one employs the Hnear parameterisation as given in Eq. 12.481 Eq. I2.93l mav be written as: 

n-l 

Wba = ^AA,MBA(v(i)) (2.94) 

where: 

AA, - A,+i - A, (2.95) 

A non-equihbrium process taking the simulation from phase B to phase A may be similarly defined. For the sake of notational 
convenience it is instructive to think of the B ^ A process as an A ^ _B process, which is performed backwards in time 
(that is the sequence of events is reversed 1731 . 1748 ). and with the simple modification that the initial configuration, now 
corresponding to v{n — 1) at time is sampled from the distribution ttx^ = 7r|j. That is the path is constructed as follows. 
At time t„_i one decrements A from its initial value of A„ to A„_i, thus performing an amount of work SWab (tji-i) = 
/3[£a„_i (v(rt — 1)) — £a„ (v(rt — 1)) . One then proceeds to equilibrate the system so as to obtain the configuration v(7i — 
2) at the time t„_2, at which point A is further decremented from A„_i to A„_2, thus incurring a work SWab {tn-2) = 
/3[£a„_2 (v(n — 2)) — £a„_i (v(?t. — 2)). This procedure is repeated iteratively until time ti at which point A is decremented 
from A2 to Ai. It is clear from this that the (temperature reduced) work increment performed at ti, when A is changed from A^ 
to Ai+i, is given by: 



6Wab{U) = /3[£a. (v.) - £a.+i (v,)] (2.96) 



The net work for the resulting path: 



t 



V 



v(n-l),v(n-2),...,v(l) = {v} 
X{t) ^ A„,A„^i,...,Ai (2.97) 



is given by: 
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n-l 



WABiM) = J2^Wab{U) (2.98) 

n-1 

= /3^[£a.(v,)-£a.+,(v.)] 



n-1 



= -/3^[£,^^,(vO-£a.(v,)] 
1=1 

= -/3W^i3A({v}) (2.99) 

Suppose now that P{Wba\t^a) corresponds to the probability of obtaining Wba for the A ^ B process and suppose that 
-PCM^baK^) 11061 is the corresponding quantity for the B A process (hereafter whenever we mention tt!^ in the context 
of the FG method we will in fact be referring to the 7^7 process, in which the initial distribution of the configurations is 
given by tt^). The fluctuation theorem, which we have also proved in appendixIXl asserts that 1721 - 1741 : 

ZbP{Wba\'^'b) - ZAe-^-^P(W^BAkl) (2.100) 



_ e-^^-PiWBAWA) 

P{WbaUb) 

In the special case of zero equilibration (which is equivalent to changing A directly from Ai to A„ in a single step): 



WBA=5WBA{ti) = /3{£a„(v(1))-£a,(v(1))} 

= Mba(v(1)) (2.102) 

Eq. I2.101l reduces to the overlap identity Eq. 12.211 For this reason we will refer to all the methods based on the identity Eq. 
I2.21l as the zero equilibration, or elementary, methods. Since Eq. I2.101l is simply a generalisation of Eq. I2.21l we will also 
refer to this formula as the overlap identity. It immediately follows that we may generalise Eq. l2.26l to: 



e-^^^ >,c (2.103) 

and Eq. OTI to: 

< A{Wba) >-k- 

= ^ w \ ^ ^2.104) 

< A{-Wba) >7r- 

Eq. I2.34l mav be replaced by the more general 'dual-phase' (DP) formula II107I : 

< G{WBA)e-^"^ 

^53^ = InZ (2.105) 

< G{Wba) 

Generally P{Wba\''^'a) ™d P{Wba\''^'b) will also face an overlap problem in the sense described in section l23l However 
the FG method does have a way of getting around this; we will postpone our discussion of this point until chapter|5] 
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2.4.9 Path Sampling Fast Growth Methods 

All the simulations mentioned until now have been discussed in the context of the sampling of individual configurations v. 
Let us now consider a simulation which, instead of jumping between configurations, jumps between paths {v} (such as those 
produced in the FG method, section l^^.Sl l. Such a simulation can be though of as comprising of a two fold procedure. In the 
first stage the simulation generates a path {v}. In the second stage a decision is made whether to accept or reject the new path. 
The idea behind the path sampling formulation of the FG method 11081 is to express it in terms of the notion of the sampling 
of paths, in the way that has just been described. 

Suppose that CPf^^^({v}) denotes the underlying distribution of the paths produced in the 7^7 FG process as described 
in section l2'.4.8l It was proved in appendixIXIthat 1731 . 1741 : 



exp{WBAiM)} (2.106) 



^B^AiM) Za 

We will now show that the FG method described in section l2.4.8l mav be interpreted as a path sampling simulation in which 
paths are generated according to J*^^^ ({ v}) in the 7-^7 process, and subsequently accepted with probability 1 . To be more 
specific, suppose that the current state of the simulation is described by the path {v} and suppose that {v} corresponds to the 
path that has just been generated. Then it is clear that if one is to obtain a set of paths distributed according to J'^i^sli^}) 
then the acceptance probability of moving from the path {v} to {v} (in the A ^ B process) is given by (see Eq. I1.22> : 

^.(W - {v}k^) = Mm{l, p^({^}|{,})a,^^^({,})} (2.107) 
where Pg({v}|{v}) denotes the probability of the simulation generating a path {v} given that the previous path was {v}. In 
the procedure described in section l2'.4.8l the path {v} is constructed from the initial configuration v(l), which itself is obtained 
from v(l), the initial configuration of {v}, by equilibrating the system for a fixed amount of time via tt^. Using the notation 
of appendixIXlit is clear that Pg({v}|{v}) is given (for the A ^ B process) by: 

PcHniM) = Psi^il) -> v(l)|7ri)P^^B({v}|v(l)) (2.108) 

where P5(v(l) v(l)|7r^) denotes the probability of making a transition from the configuration v(l) to the configuration 
v(l) when sampling from tt^ for a fixed amount of time and where Pa^b({v}|v(1)) denotes the probability of obtaining a 
path {v} via the A ^ B FG process of section 12.4.81 given an initial configuration of v(l). Since from Eq. IA.3l we know 
that: 

F5(v(l) ^ v(l)k^) _ 7r^(v(l)) 

P5(V(1)->V(1)|^^) TTj^Kl)) 

it immediately follows that (for the A ^ B process): 
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Pg{{v}\M) ^ P5(v(l)^v(l)ki)P^^B({v}|v(l)) 

Pg({v}|{v}) Ps(v{1) v(l)|^^) P^^b({v}|v(1)) 

7r^(v(l))P^^B({v}|v(l)) 
7r:i(v(l))PA^s({v}|v(l)) 



(2.110) 



where we have use the fact that: 



n^BiM) = ^^(v(1))P^^b({v}|v(1)) (2.111) 

Eq. 12. 107 1 then becomes; 

^a({v} -> {v}|7rl) = 1 (2.112) 

Therefore we infer that the FG method described in section l2'.4.8l mav be thought of as a path sampling experiment in which 
the paths are generated according to the mechanism described in section l2'.4.8l and in which moves between old and new paths 
are accepted with probability 1. 

The benefit of the path sampling interpretation of the FG method is that it allows us to generalise the PS method (Eq. 
I2.74t so as to be applicable within the framework of the FG method. To see this we recall that the canonical PS sampling 
distribution iTpg (Eq. I2.74> realises the Mb a distribution given in Eq. 12.761 It is not hard to show that the path sampling 
distribution: 

5'ps(W,7)=^7J'7^7({^}) (2.113) 
in which 7 is a stochastically sampled variable, realises the following Wba distribution: 

P(W \ ZaP{Wba\'^'a) + ZbP{WbaU%) 

F{Wba\t^ps) = H — (2.114) 

Za + ^b 

where we have used P{Wba\t^ps) to denote P(VFb^ | CPpg) in order bring out the links with the zero equilibration cases. Eq. 
12.1 13l is essentially the path sampling generalisation of Eq. 12.741 The implementation of Eq. l2.113l involves the employment 
of an additional Monte Carlo move which allows one to switch between processes. That is suppose that C,a~*b labels the 
A ^ B FG process and Cb~>a labels the B ^ A FG process, in which paths are generated according to Eq. 12.1101 Then 
in this notation P{Wba\''^'^) = PIW^saICt^t)- As with the original FG method, the FG phase-switch (FG-PS) procedure 
involves (for the C7-»7 process) generating paths in the manner described by Eq. 12.1081 and subsequently accepting with 
probability 1 (see Eq. I2.112> . On top of this one introduces an additional Monte Carlo move in which one attempts to switch 
between the Ct^t- and Ct^t processes whilst leaving the path {v} unperturbed; the acceptance probability for such a move 
is given by: 
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= Min{l,e"^^'^} (2.115) 

where we have appealed to Eg. lA.lOl In the case where P(VFsAk^) and P(W^Byi|7r^) partially overlap about the Wba 
regions, the FG-PS method allows one to sample all the paths that contribute non-negligibly to the estimator of the FED. One 
may then proceed to estimate ^-ba via: 

< H-Wba) >-n-p^ 

2.4.10 Looking Forward 

We are now in a position to explain more fully the scope of the study presented here. We have seen that the key obstacle to 
the problem of determining FEDs is the overlap problem. To deal with this problem one must: 

1 . Choose an appropriate global displacement D in order to map configurations of one phase onto the other (see Eq. 11.431 
figure n~3t and choose a representation v (see Eq. 12. 5> 

2. Choose an estimator |46| . 

3. Employ some form of extended sampling strategy 1211 . 

In succeeding chapters we take up each of these points in turn. In chapter|3lwe will deal with the first point and will show 
how the overlap depends on the representation one chooses to work in. Specifically by working in a representation in which 
the effective configuration v corresponds to the normal modes of a crystalline solid we construct a transformation (called the 
fourier space mapping, FSM) which, under certain conditions, is more efficient than the RSM (see Eq. 12. 4t . 

In chapterl^we provide some insight into the second point and show how in the case of partial overlap, which is when 
there are Wba macrostates over which both the estimators P{Wba\''^'x) and P{Wba\i^%) are non-zero, and when there are 
other macrostates for which only one of the estimators is non-zero, then the estimator of Eq. 12.1051 can be guaranteed to 
work (in the sense that the estimate of iK^yi is free of systematic errors) for any G{Mba) simply by restricting the regions 
of Mb A space from which the non-negligible contributions to the expectations come. Furthermore we will also show that 
there is a family of estimators of the form of Eq. I2.105l for which no such restrictions are required since by construction these 
estimators are guaranteed to be free of systematic errors, provided that there exists some overlap between P{Wba\'^a) 
P{Wba\'^%)- 

The third point is addressed in chapter|5l We start off by applying the basic theoretical techniques of umbrella sampling 
1211 . 1621 . 1791 to the problem of phase behaviour, and use the recently developed Wang-Landau 1851 technique to construct 



the MUCA weight function needed in order to estimate the FED (see section 12.4. 3t . In addition to this we present a new 
method of overcoming the overlap problem (called the Multihamiltonian (MH) method) which involves simulating several 
independent samphng distributions simultaneously. Like the WHAM method, the benefit of this method is that it is highly 
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parallelizable. We then proceed to make an investigation of the FG method, and show how it is able to effectively overcome 
the overlap problem. 

Having formulated the FED problem in a strictly classical framework we proceed in chapter|6lto consider the quantum 
formulation as provided by the path integral formalism. The quantum FED problem is even more computationally intensive 
than its classical counterpart, and we show that the MH method developed in chapter|5lprovides an efficient way of estimating 
the relevant quantities. In particular we illustrate its power with a study of the role of zero-point motion in determining crystal 
stability. 



Chapter 



3 



Tuning 



the Representations 



3.1 Introduction 



We saw in sections \L2\ and [T . 3 . 21 that different phases of a given material may be thought of as corresponding to different 
basins of attraction of the configurational energy E{r). For finite-system-constructs of the relevant phases, there may exist 
several metastable basins of attractions (corresponding to the different phases) and it is ones desire to find the most probable 
one. In the thermodynamic limit this basin of attraction becomes overwhelmingly more probable than the others, resulting in 
the corresponding phase being the one that is found in nature. In this thesis we will focus on the case where there are two 
candidate phases, and our task will be limited to that of determining the more stable out of the two |41 1. 

Computationally the task of finding the more probable out of the two phases involves implementing a Metropolis simula- 
tion in which one visits these two regions in a single simulation. One may then estimate the FED via Eq. II. 81 and Eq. 11.151 
However the sequential or pathwise nature of the Metropohs method and the presence of a region of (absolute) configuration 
space, in between the two phases, of intrinsically low probabihty means that a simulation initiated in a given phase will remain 
trapped in that phase. 

If instead of working in the r representation (in which one attempts to estimate the quantity Rba in Eq. II. 8> one chooses 
to work in the v representation (in which one attempts to estimate the quantity JI-ba in Eq. I2.13> . one greatly alleviates the 
difficulty associated with the problem of estimating the FED of the two phases by bypassing this intermediate region altogether 
(compare figure lOI (a) and (b)). The residual difficulty that is left in the associated problem is captured in the amount by 
which the two phases overlap in v space, and it is this difficulty which must be overcome in order to estimate "JI-ba- 

In order to gauge the amount of overlap between these two regions, one must (by virtue of the overlap identity Eq. I2.21> 
observe the amount of overlap that is present between P{Mba\tta) ^^'^ P{Mba\t^b)- This overlap in Mba space is clearly 
dependent upon two factors: 

A The choice of the reference configurations (see Eq. 12. 6> . Different choices of and R^ correspond to different 
choices of D in Eq. I2.3l (see also Eq. 11.431 Eq. 11.451 figure FTjl ). 

B The choice of representation v l64l . l65l . which Mba(v) depends on (Eq. 12.151 1. 
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The ideal choices of the global translation D and the representation v are ones for which P{Mba\'^a) ^(-^baI'I's) (see 
figure 12721 collapse onto the distribution: 

P{M^y\TT^) ^ S{M^-y + \nR^^) (3.1) 

Departures from this ideal limit manifest in the bi-modality of P{M;yj\Tr!!^), as shown in figure IZ2I The more efficient the 
choice of D and representation v, the closer the distributions P{Mba\'^a) ^^"1 ^'IA^baKb) will be to the ideal limit (Eq. 

The general challenge to the problem of estimating the FED is that of tuning the PM so as to maximise the overlap 
between the two regions of effective configuration space (see figure im fb)). In this chapter we will investigate this issue in 
the particular context of the two phases being crystalline solids. We will primarily focus on the role of the representation 
(issue B). Specifically we will see that a PM as formulated in a Fourier space (normal mode) representation provides some 
strategic advantages over the real-space representations (RSM, Eq. \2.4l utilised in previous studies (Q, 1341 - 13 81 . 

In order to set the context we will now introduce the model system (the Lennard Jones solid) which has been employed in 
all the simulations carried out in this work. This will be followed by a section illustrating how the PM is implemented for our 
model systems, followed by a brief discussion of the role of the global translation vector (issue A) in the mappings between 
the two phases. 

3.2 The model system 

For the work in this thesis our model system will be comprised of particles interacting via the pairwise Lennard- Jones (LJ) 
interatomic potential: 

0(r,,) = 4e[(-)i2 -(-)«] (3.2) 

where 4>{rij) corresponds to the interaction energy between particles i and j separated by a distance — jf^; — rj\, and where 
Ti and Tj are the position vectors of particles i and j respectively. The overall configurational energy -E'(r) is then given by: 

N N 

Generally the use of the full configurational energy in Eq. 13.31 in a simulation is prohibitively expensive, and one instead 
employs some form of approximation whereby the potential is truncated at some distance from the particle fT091. The ensuing 
phase diagram (a schematic of which is shown in figure ITlT l is highly sensitive to this truncation radius; the latter has to be 
chosen carefully if one is to reproduce the true characteristics of the phase under consideration. By analysing the fluctuations 
in the ground state energies and the harmonic free energy differences as a function of the truncation radius, it was found in 
II 101 (where identical systems were employed to those used in this work) that a truncation radius Tc given by: 



(3.4) 
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Figure 3.1 : Schematic of classical phase diagram for the LJ potential 

The figure shows a (scaled temperature versus scaled pressure) schematic for the classical phase diagram of the LJ potential 
in Eq. 13.21 In order to determine the hep, fee phase boundary, one must determine the more probable of the two phases. 
The methods utiUsed and developed in this thesis are able to address this sort of problem. The methods can, without much 
difficulty, be generalised to the case of the solid-liquid boundary. An initial line of investigation into this has been made in 
lEH, El. (Seealsof22|). 



yielded sufficiently accurate results (where r„„ is the nearest neighbour distance). A truncation radius of this magnitude 
essentially amounts to each particle interacting with both its first nearest neighbour shell (comprising of 12 particles) and 
its second nearest neighbour shell (comprising of 6 particles) 111 1 II . This truncation radius was also employed for all the 
simulations used in this thesis, with the exception of those in chapter|6](in which a truncation radius of Vc = l.l?'nn was 
employed). 

Unless otherwise stated, the system size that we have employed is = 216, and the densities are pa^ = 1.092. We will 
also quote all results in terms of the reduced temperature: 

T* = — (3.5) 

e 

We also make a point here that it is our intention, in this thesis, only to use the LJ system as a testbed for the various methods 
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and not to give definitive results for the LJ phase diagram (which has already been done in II 101 ). 

3.3 The Phase Mapping for crystalline solids 

Let us now focus out attention on the regions of the phase diagram near the fcc-hcp boundary (see figure lTTl . and let phase 
label A refer to the fee structure and phase label B to the hep structure. In the case of crystalline solids, the reference 
configuration (R^) may most conveniently be chosen to be the ground state configuration (i.e. the lattice sites themselves). 
The full PM then involves a switch of lattice vectors, accompanied by the mapping of the displacements (of the particles from 
their lattice sites) of one phase onto the (possibly modified) displacements of particles of the other phase. In the case of the 
RSM, these displacements are unmodified on the transition of the phases. 

For the fee and hep structures, one may identify families of planes which are common to the reference configurations of 
both the structures. Whereas the fee structure may be thought of as being comprised of three families of close packed planes 
(see figure ll!2l (a)). the hep structure may be thought of as comprising of two layers of close packed planes (see figure|^](b)). 
The geometry of the planes are such that they permit a simple mapping of the lattice structure of one phase onto that of the 
other (see figure l33l . In this case the operation of the RSM takes a particularly simple form; one merely slips the planes as 
shown in figure 1331 whilst at the same time preserving the relative positions of the particles within a given plane. For a more 
detailed illustration of these planes and their corresponding structures, we refer the reader to II 101 and II 121 

The choice that we have made for mapping the lattice sites of one structure onto those of the other does not exhaust the 
possibilities for D. In fact for a given labelUng scheme of the particles, one may choose to map the lattice vector of particle 
i of structure 7 onto that of particle j of structure 7, instead of mapping it onto the same particle of the corresponding phase 
111 131 . This procedure may equivalently be thought of as a permutation of the index labelling the particles under the operation 
of the PM; there are of the order of N! such permutations. This point was investigated to a limited extent for the RSM in the 
case of hard spheres by Jackson et. al. 1 35 1. In their work they investigated cases where the planes were displaced a distance 
greater than that shown in figure l33l in transforming from one phase to the other. They also investigated the cases where the 
planes of the fee structure were randomly stacked when forming the hep structure, and also the case where the displacements 
of a particle of the fee structure were mapped onto those of a randomly chosen particle of the hep structure. All the alternative 
mappings resulted in greater number of hard spheres overlapping, as compared to the mapping presented in figure l33l In this 
work we do not investigate this issue any further 1 1 14|. 

In the case of the solid-liquid phase boundary, an appropriate reference configuration for the liquid phase is simply any 
typical configuration of the fluid phase 1361 . Though we will not have anything more to say about this, we note that an 
investigation along this direction has been made (for the hard sphere case) in P361 and more recently (for the soft potential 
case) in 1371 . 
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(a) FCC Structure (b) HCP Structure 

Plane Ai O O O O Plane Ai O O O O 

Plane B, © © © © pi^, g, © © © © 

Plane C| • • • • Plane A, O O O O 

Plane A2 O O O O Plane Bi © © © © 

PlaneB, © © © © pi^ne A, O O O O 

Plane C2 • • • • PlaneB, © © © © 



Figure 3.2: The fee & hep struetures 

The fee strueture may be thought of as eomprising of three families of close packed planes, labelled as A, B, and C in figure 
(a). 

The hep structure, on the other hand, can be represented in terms of the two families of close packed planes (stacked in the 
ABAB... formation, see figure (b)) 
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FCC Structure 

Plane R Plane S 
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HCP Structure 



Plane Ci 



Plane At 



Plane B2 



Plane C, 



© ► 



I O 
© © 

■4 • -4 • 

I ■* O 

© ► © 



o o 



© © 



Figure 3.3: The PM Transformation 

The figure showing the fee stmeture on the left (characterised by 3 different planes, stacked in a 'ABCABC...' formation) 
being transformed to the hep structure (characterised by 2 planes, stacked in the formation 'ABAB...'. See also Figure IT2i . 
The atoms of plane Ci are made to lie beneath those of plane Ai, the atoms of plane A2 underneath those of Bi, the atoms 
of B2 under those of Ai, and the atoms of C2 underneath those of plane Bi. Note that periodic boundary conditions apply. A 
consequence of these boundary conditions is that one could, in principle, slide the planes a greater distance than is shown, so 
as to transform one phase into the other However we have illustrated only one possibility, the one which is actually used in 
the simulations. See f351 and fl 101 for more details on the other possibilities. 
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3.4 Formulation of the Fourier Space Mapping 



In section 13711 we mentioned that there are two issues at hand, the first being the choice of a suitable reference configuration 
and the second being the choice of an appropriate representation v. Given that the lattice sites themselves serve as both 
natural and convenient choices for the reference configurations, we will now concentrate our efforts on finding an optimal 
representation v II 151 . Specifically we will choose a representation in which v corresponds to the normal modes (whose 
corresponding mapping Eq. 12.141 we call the Fourier Space Mapping, FSM), and compare the overlap that one obtains in 
this case to that of the RSM (Eq. I2.4> . We will show that in the harmonic limit the FSM ensures that the two phases have 
identical excitation energies, so as to ensure perfect overlap (in the sense of Eq. 13. U between the two phases in the effective 
configuration space as parameterised by the coordinates v. In contrast to this we will also show that for the RSM this overlap 
wiU never be perfect 11161 . 

3.4.1 Constructing the transformation 

To motivate the transformation, consider the Taylor expansion of the (excitation) configurational energy (Eq. 12. 9> in powers 
of the displacements u with respect to the reference configuration II 171 : 

£.^(u) = E(R^ + u) - £;(R^) = £'^{u) + £;(u) (3.6) 

where the second term £!j denotes the harmonic contributions (containing powers of second order in the displacement u) and 
£" denotes the anharmonic contributions (of at least third order in the displacement u). The harmonic contributions may be 
written as: 

£i;(u) = iu^K^u (3.7) 
where is the 3N x 3-/V dynamical matrix. If K'^ denotes the entry in the i-th row and j-th column of the matrix then: 



^ dE^d^ ^^ ^^ 



Since K-^ is a symmetric matrix (i.e. it is Hermitian) we may (via the Gram-Schmidt orthogonalisation procedure if necessary) 
construct a set of orthonormal vectors {e^} which are the eigenvalue of K^. In our case we will take e^^ to be the 3N column 
vector corresponding to the j-th eigenvector of K^. If we set (Eq. 12. 5> such that: 



e 



T!/' = —7= (3.9) 



or: 



T, = (^ei , ^e^, ) (3.10) 
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where el^ is the i-th component of the j-th eigenvector of and where fc* is the eigenvector of e* then from Eq. 12.51 




(3.11) 



The summation in Eq. 13.1 H is performed over the 3N components {vm} (which we refer to as the fourier coordinates) of the 
column vector v. Substituting Eq. 13.1 ll into Eq. l3.7l one finds that: 



EEE 



t m n 



EE 

m 

E 



(3.12) 



where we have invoked the orthonormality of the eigenvectors: 



E 



in _ r 
7 7 — " 



(3.13) 



In other words, by choosing an appropriate representation v (which we call the fourier representation) in which the normal 
modes (or fourier coordi 
contains no phase labels 



modes (or fourier coordinates) of one crystalline solid are mapped onto those of the other, one may cast £!j: into a form which 



(3.14) 



Using the fact that [T^ 



kl^ei^ or 



V 



(3.15) 



we may use Eq. l2.12l to write down the matrix elements of S^a: 



CU' _ V A irn jni 

m V -D 



(3.16) 



This matrix has the associated determinant: 



(3.17) 
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The expression in Eq. 12. 101 for the FED may then be written as: 

AFba = AE%a + AF'^a + AF^a (3.18) 

where 

1 1 

- ^^IndetSBA = 2^EM^) (3.19) 

is the harmonic contribution to the overall FED (Eq. I3.18> . Since the harmonic contributions to the excitation energy (Eq. 13.71 
I3.14t are equal if they share the same fourier coordinates, we see that only the anharmonic contributions to the energy will 
survive in the evaluation of Mba- 



A/ba(v) - /3[£b(v) - £a(v)] 

= /3[£|(v) - £^(v)] (3.20) 



As a result the third term in Eq. 13.18 



Ai^sA = -/3"'ln3?BA (3.21) 

reflects the purely anharmonic contributions to the FED. 

One may realise the associated mapping, which we call the fourier space mapping (FSM), within the framework of the 
u representation through the operation in Eq. 12. 141 where S^-^ is given by Eq. ESI Fi sure 1341 shows a schematic of the 
conceptual procedure involved in the mapping. 

For systems with periodic boundary conditions, additional considerations must be taken into account in constructing the 
FSM. In appendixiBlwe outline the necessary modifications which must be incorporated into the transformation Sba in order 
to accommodate these constraints. Generally what one finds is that the use of periodic boundary conditions means that three 
of the eigenvectors of the dynamical matrix K-y (Ea. l3.8> will have zero eigenvalues. Suppose that el^, e^, and ei^ correspond 
to these null eigenvectors. The findings of appendixiBlare that one may simply omit these coordinates in the evaluation of the 
relevant quantities. For example the displacements u are now given by: 

"=E-"7pf (3.22) 
and the transformation matrix S ba (Eq. I3.16> is now replaced by: 

k 




^%a-T.\W.^'b< (3.23) 

7n—4 V -D 



We will assume that all subsequent summations over the fourier coordinates {vm} will be of the form of that employed in Eq. 
13 .221 and Eg. 13.231 i.e. summations which exclude the null modes. 
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IDENTITY 
TRANSFORMATION 



Yb =Ya 



Figure 3.4: Diagrammatic Representation of the FSIVI 

In order to transform (within the harmonic transformation) the configuration (of phase A) onto a configuration (of 
phase B) so that both configurations are excited above their respective ground states by the same amount (see Eq. |^}, 
we first transform from to v^, which represents the configuration in what we call fourier space. We do this using the 
transformation [T^]^^, given by Eauation l3.15l We then force phase B to share the same set of fourier coordinates as phase 
A by setting equal to va- We do this via the identity transformation. This ensures (by appeal to Eq. I3.14^ that (within the 
harmonic approximation) the two phases have the same excitation energies above their respective ground states. Finally we 
transform back to real space via the transformation [T^]. The net transformation is given by S^^ 111 181 . 
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3.4.2 Summary 

Summarising, we have constructed a transformation S-y^ (Eq. I3.16t called the fourier space mapping (FSM) which maps a 
configuration r-^ of phase 7 onto a configuration r-^ of phase 7 so as to ensure that the two phases are of identical excitation 
energies in the harmonic limit, thus guaranteeing perfect overlap. As the harmonic contributions to the FED are already known 
exactly via Eq. 13.191 the utility of this transformation will not lie in the overwhelmingly harmonic regime but will instead he 
within the anharmonic regime. In particular, since the overlap can be arbitrarily improved simply by reducing the temperature 
(O 1 as T — > 0), one might expect that the problem of estimating the anharmonic contributions to the FED might become 
controllably small in the T ^ limit. We will see that this is not quite the case. 

Before discussing the limitations of the FSM we will first focus on the efficiency with which it overcomes the overlap 
problem, and we will use the RSM for comparison. In order to compare the efficiency with which the FSM and RSM tackle 
the overlap problem, we will, in the next section, investigate the issue of the overlap between the two phase-constrained 
distributions of Mba via analytic techniques. Specifically we will show how, in the harmonic limit, the overlap problem 
vanishes for the FSM whereas it tends to a constant value for the RSM (in the sense that P{Mba\t^'^) assumes a stationary 
form), thereby serving as the most extreme illustration of the dependence of the overlap problem on the representation. We 
will also use these results to outline some of the basic limitations that the FSM faces in estimating the anharmonic FEDs 



Since a probability distribution is completely characterised by its cumulants, one way to obtain insights into the behaviour of 
the overlap problem is to focus on the cumulants of P{Mba\t^^)- Of particular importance are the first two cumulants, since 
it is these which correspond to the mean and variance of the distribution, and since it is these which will be most important in 
indicating the amount of overlap that will be present II 191 . To define the cumulants let us expand 31^^ (using Eq. I2.26> as the 



3.5 Analytic results 



exponential of a power series in M^j II12QI : 



00 



(3.24) 



where a;„ is the n-th cumulant. The first three cumulants are then given by: 



UJl =< Mjj >7r= 



(3.25) 



and 



UJ2=< M?^ >^c 



(3.26) 



and: 



= 2 < M^y^ >la -3 < M;yy >^e< M?^ >^c + < >^c 



(3.27) 
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The FED (Eq. I2.10> may be cast into a simple form using these cumulants: 



afba = as; 




(3.28) 



In the next two sections we will analyse the behaviour of these cumulants for the RSM and the FSM in the harmonic limit 

(T ^ 0). 

3.5.1 Fourier Space Mapping 

Drawing on anharmonic perturbation theory one may expand the configurational energy £y (v) as a power series in v in which 
the contributions of successive orders become increasingly smaller As the harmonic limit is approached one may discard 
all but the terms which scale, upon integration, with lowest order of T, thereby considerably simplifying the analysis of the 
cumulants. The results of the theory (appendix |Cj may be summarised as follows. In the limit T ^ (or /3 ^ oo), the 
cumulants of P{AI^-y\7T!r) scale in the following way with temperature: 



(02 : if n is even 
[1+11 (3.29) 
(3-^ : if n is odd 

From Eq. Il28l and Eq. Il29l one observes that in the limit T ^ one may write a cumulant approximation expression for the 
anharmonic contributions to the FED as: 



The behaviour of the FSM in the low temperature regime is in sharp contrast to the RSM in which all the temperature-scaled 
cumulants tend to a constant value, indicating that the overlap of the phase-constrained distributions of Mb A assume a constant 
value in this limit. To see this we once again appeal to anharmonic perturbation theory and expand the configurational energy 
as a power series of in v. The results have been worked out in appendix|D]and may be summarised as follows: 



lim AF; 



(3.30) 



3.5.2 Real Space Mapping 




(3.31) 



and: 




(3.32) 



where {k} are the eigenvalues of the matrix with elements: 




T 



T 



(3.33) 
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More generally one may conclude that: 

lim ujn 0(1) (3.34) 

so that all the cumulants contribute to the FED (Eq. I3.28> at arbitrarily low T. The fact that the temperature-reduced cumulants 
(Eq. I3.34t tend to a constant value in the harmonic limit translates to the fact that the overlap between P{AIba\t^a) ^'^'^ 
P(MB^|7r^) tends to a constant amount in this limit 11211 . 

3.6 Some numerical results 

In section (in section l3'.6.H we will start by investigating (numerically) the overlap of the phase-constrained distributions for 
both the FSM and the RSM in the low temperature and high temperature regimes 11221 . We will find that at sufficiently low 
temperatures the overlap associated with the FSM is, as expected, better than the RSM. However for higher temperatures it is 
the RSM which has the better overlap. In section l3.6.2l we will then proceed to estimate the anharmonic FEDs 11231 in the 
low and high temperature regimes. We will find that for sufficiently low temperatures, the FSM does not require any extended 
sampling in order to arrive at an estimate of the FED which is free of systematic errors. This is in contrast to the RSM, which 
will, in the most general case, require extended sampling. On the transition to higher temperatures we find that both the 
RSM and the FSM require extended sampling in order to overcome the overlap problem. We will then end this section with 
discussion of the relative efficiencies of the two methods. 

3.6.1 Overlap 

To start off with, let us consider the behaviour of the overlap problem for the FSM and the RSM in the low temperature 
regime. In considering the issue of the overlap, we recall (see section l23l that the difference in the free energies of two 
phases manifests itself as an asymmetry (about the origin) in the position at which the two phase-constrained distributions 
P{Mab\'^a) PiMABlT^B) intersect. For systems which have a small FED (as is the case for the systems employed 
here), this asymmetry will be ever so slight. An illustration of this for the RSM is shown in figure 1331 In characterising the 
overlap, the observed (approximate) symmetry allows us to focus our attention on the behaviour on a single phase-constrained 
distribution. The amount of overlap (which can, in an approximate way, be measured by the peak to peak distance) may 
then be gauged by measuring the distance of the peak of this phase-constrained distribution from the origin. The smaller this 
distance is, the greater will be the overlap between the two distributions 11241 . 

Figure shows the scaling of the first two cumulants of P{Mab\'^b) "^^^^ temperature. In accordance with Eq. 13.291 
hnear scahng is observed for the FSM. Moreover we see from figure |^](b) that the corresponding cumulants for the RSM 
tend to the limiting values as predicted by Eq. 13.3 H and Eq. 13.321 Figure lTTl investigates the overlap for a range of temperature 
by looking at the phase-constrained distribution P{AIab\'^b)- immediately observes that as the temperature is reduced, 
the overlap of the FSM becomes considerably better than that of the RSM. Whereas in the case of the RSM P(Af^B|7r^) 
tends to a limiting (stationary) form (see figures l3^ (a) and (b)) 11251 . in the case of FSM the corresponding distribution tends 
to the ideal Umit of the delta function (Eq. 13. U . which in this case is centred on the origin 11261 . 
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Figure 3.5: P{MAB\n%) and P{MAB\n%) for the RSM 
The phase-constrained probability distributions P{Mab\'^a) '^^'^ ^'(-A/yis|7r^), as obtained for the RSM. An approximate 
symmetry is exhibited: each distribution is a mirror reflection of the other about the origin. A similar symmetry is also 
observed for the FSM. 
T* =0.8 (see Eq.|33J. 



To understand the low-temperature behaviour of the FSM we note that the method only probes the anharmonic effects 
(see Eq. I3.20> since by construction the harmonic contributions to the configurational energy cancel out in the two phases. 
Since these anharmonic contributions vanish as the harmonic limit is approached, the observed behaviour is in accordance 
with what is expected. The RSM, on the other hand, does not 'fold out' the harmonic contributions. For this representation 
Ji-BA must assume a constant value {JI-ba — det S ba) in the harmonic limit. From the overlap identity (Eq. I2.21> we see that 
for this to be the case the ratio P(Af^B |7r^)/P(A/AS I'l's) must approach a stationary value in this limit. One way for this to 
be achieved is for both P{MAB\'n'A) ™d P(il/As |7r^) to tend to stationary non-singular distributions. This is precisely what 
is observed. 

So far we have analysed the behaviour of the FSM in the limit T ^ limit. Let us now discuss the behaviour of the 
transformation as the temperature is raised ( so as to make the anharmonic effects more prominent). From figure IT71 (c) and 
(d) we see that, though at low temperatures the overlap of the FSM is better than that of the RSM, at high temperatures the 
situation is reversed; the overlap of the RSM is better than that of the FSM. This can be understood by first noting that the 
FSM is a global transformation. That is, whereas for the RSM a single particle perturbation in phase 7 corresponds to a single 
particle perturbation in phase 7 (Eq. 12. 4> . in the case of the FSM a single particle perturbation of phase 7 manifests itself as 
a global perturbation in which all the particles of 7 are perturbed (Eq. 12.141 Eq. I3.16t . Therefore the anharmonic corrections 
to the energy induced by the exploration of a particle of phase 7 into the anharmonic regions of the configurational energy 
£^(v) will, under the operation of the FSM, propagate on a global level in phase 7. In contrast these anharmonic effects 
only propagate on a local level under the operation of the RSM. As a result one finds that in the highly anharmonic regimes 
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Figure 3.6: Temperature-dependence of the cumulants for the FSM and the RSM 

a) shows the cumulants wi and uj2 for the FSM for low temperatures. These cumulants are given by Eq. I3.25l and Eq. 13.261 

b) shows the scaled cumulants ^ and ^ for the RSM, where lu'I and correspond to the values obtained from theory (Eq. 
ITITI and Ea. l332l . 

The temperatures employed in (a) were the highest temperatures at which the simulation results agreed with the predictions 
(Eq. 13.291 1 of leading order anharmonic perturbation theory. 
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Figure 3.7: Comparison of the FSM and RSM with varying temperature 

The evolution of P{AIab\t^b) "^he RSM and FSM) with temperature. The temperatures correspond to (a)T* = 0.01, 
(b)T* = 0.1, (c)r* = 0.5, (d)T* = 1.5. 

Note that in (a) and (b) the RSM essentially assumes its low temperature limiting form 11251 (see also figure lT51 (b)). 



CHAPTER 3. TUNING THE REPRESENTATIONS 



82 



Mxj^ will be considerably amplified for the FSM as compared to the RSM. In addition to this one finds that for the RSM 
which we have employed (see section l33l . intra-planar correlations are preserved 1341 on the transition from one phase to the 
other, a fact which is true even in the anharmonic regime. The FSM, on the other hand, will preserve little correlations in the 
anharmonic limit, since the anharmonic effects effectively contaminate the transformation. The net result of these two effects 
is that the RSM eventually becomes more efficient than the FSM on the transition to sufficiently high temperatures. 

An important point to note is that the deviations seen in the distributions of the FSM from the ideal limit (Eq. 13. 1> . 
obtained on increasing the temperatures, are not due to the increasing prominence of the intrinsic anharmonic effects but 
are instead due to the inefficiency of the representation 1651 . To see this we note that in the harmonic limit the FSM maps 
configurations of phase 7 onto configurations of phase 7 which are of the same effective temperature. On increasing the 
temperature the contamination of the FSM transformation by anharmonic effects results in configurations of phase 7 being 
mapped onto configurations of 7 which are effectively hotter than the typical configurations of phase 7. As a result the 
anharmonic corrections to the total excitation energy will be amplified, under the operation of the FSM, over those corrections 
that are intrinsically present in phase 7 at that temperature, so that M^^ is not truly representative of the intrinsic anharmonic 
effects. Therefore, based on an observation of P{Mab\'^b)' oi^^ naively conclude that the anharmonic effects are greater 
than they really are. This idea is supported by figure ITTl fb) where, despite the fact that at T* — 0.1 P{Mab\'^b) has 
assumed its low temperature (harmonic) hmiting form for the RSM (see also figure |^](b)), the corresponding distribution 
for the FSM exhibits a significant departure from the ideal limit (Eq. 13. 1> . In fact from figure l3T6l (a) we see that departure 
(for the FSM) from the linear scaling predictions (Eq. I3.29> of leading order anharmonic perturbation theory are observed at 
a temperature two orders of magnitude lower than that for which departures from the harmonic predictions are observed for 
the RSM. From figure ll!9l we see that the anharmonic contributions to the FED are smaller than what one would expect based 
on the observation of P{Mab\t^b) figure ITtI 

3.6.2 Estimating the FEDs 

Figure l3^ shows the full AIab probability distributions for the FSM-PS method (see section |2.4.7> : no form of extended 
sampling was employed here. It is clear that on the transition to sufficiently low temperatures, the FSM-PS method is no 
longer plagued with being constrained to the phase which it is initiated in 11271 signalling the absence of an overlap problem 
since the fiill (effective) configuration space associated with both the phases is visited. The overlap problem (for this estimator 
at these temperatures) is effectively cured. 

Fi gure U3\ show s the estimates of the anharmonic contributions to the temperature-scaled FED for the range of tempera- 
tures shown in Figure lsTSl The significant feature is the agreement of the three estimators (the FSM-EP estimator Eq. 12.281 
the FSM cumulant approximation Eq. 13.301 and the FSM-PS estimator Eq. I2.78> at low temperatures and the disagreement 
between them at high temperatures. The FSM-PS estimator indicates that the anharmonic contributions are unresolvably small 
throughout. This conclusion is consistent with what one would expect based on the extrapolation of FSM-PS measurements at 
higher temperatures (see figure lTTol . Moreover since the FSM-PS method visits (for the range of temperatures investigated 
m figure the regions of the effective configuration spaces associated with both the phases (as is clear from figure 13.81 1 
we expect that it should be free of systematic errors. Accordingly we will use the results of the FSM-PS estimator as the 
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Figure 3.8: The probability density function P{Mab\'jtps) for the FSM as a function of temperature 

a)T* = 0.00009, b)T* = 0.001, c)T* = 0.06, d)T* = 0.2 



CHAPTER 3. TUNING THE REPRESENTATIONS 



84 




Figure 3.9: fS^AF^g versus T* 
The temperature-scaled anharmonic contributions to the FED, f3'^AF^g, estimated (without the use of extended sampling) 
via the FSM-PS method (Eq. IZTTt . the cumulant approximation (Eq. l330l . and the FSM-EP formula (Eq. IZ26l . 
We have plotted /J^AFjJ^ instead of AFjJ^ since, from perturbation theory, we know this quantity should be constant at 
sufficiently low temperatures. From perturbation theory we also know that at higher temperatures contributions to Ai^^^ 
appear which scale as P^^. For this reason we have use a linear extrapolation to compare the high temperature results of 
figure ITTOI to those obtained here 1 128 1. 
AF^g is in units of k/e. 
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benchmark (albeit a rather uninteresting one) for the other methods. 

Let us start by discussing the low temperature limit. The ability of the FSM in overcoming the overlap problem (on the 
transition to sufficiently low temperatures) for all the estimators is clearly evident from figure |3^ However contrary to initial 
expectations this does not mean that the task of resolving the anharmonic contributions to FED becomes any easier To see 
this we note that for the FSM uji ^ T and uj2 ^ T (see Eq. 13.291 1. with all higher orders vanishing at a higher rate. As a result 
the error in ones estimate of the mean (uji) of P{Mab\''^%) is proportional to , since this quantity itself is proportional 
to the standard deviation ^/uj2 of P{Mab\t^b)- The fact that uji decays faster, with decreasing temperature, than does its 
error is indicative of a signal to noise problem that is present on the transition to lower temperatures. That is, in the region 
where the overlap problem is overcome the (small) anharmonic contributions are entirely masked by the residual noise in the 
transformation in a way which is not cured by going to (still) lower temperatures. 

As the temperature is increased what is observed in figure lT9l is the eventual departure of the estimates of both the cumulant 
approximation and the FSM-EP methods from the estimates of the FSM-PS method. The first estimator to depart from the 
benchmark line is the cumulant approximation (Eq. I3.30> . signalling the increasing importance of the higher order cumulants 
in the expansion of Eq. 13.281 H291. Upon inclusion of all the cumulants (which is simply done by estimating IRsyi via the 
FSM-EP method, Eq. I3.24> one does indeed estimate the quantity A_Fg^ correctly, since the results of the FSM-EP method 
and the FSM-PS method coincide. On increasing the temperature further, the estimates of 'JI^ra via the FSM-EP method also 
begin to depart from those of the FSM-PS method. The reason is that now systematic errors are arising from the fact that 
P{Mab\''^'a} ^'^'1 P{Mab\''^b) completely overlap. This is clearly the case in figure l?7Sl (d). We note that unlike the 

cumulant approximation, which underestimates the FED 1 129 1, the FSM-EP method overestimates the desired quantity when 
systematic errors begin to set in II130I . 

The decreasing amount of overlap between P{Mab\t^'a) ™d P{Mab\''^%) obtained on increasing the temperature means 
that eventually even the FSM-PS estimators will not be free of systematic errors without the use of some form of extended 
sampling strategy (see section l24l and chapter|5|l. For the systems investigated here, the maximum temperature at which the 
FSM-PS method could successfully be implemented without the use of extended sampling was T=0.2 (figure IXSt . Beyond 
this extended sampling was required. Figure lTTOI shows the temperature-scaled anharmonic FEDs obtained with the MUCA 
11311 sampling distribution and shows clear agreement between the RSM and FSM methods. The results are consistent with 
those of l35il . In particular the anharmonic contributions act so as to favour the hep (B) phase. 

The MUCA extended sampling strategy (and indeed all the other extended sampling strategies, to be discussed in chapter 
|5ji allows one to tackle the overlap problem irrespective of the representation. The choice of representation then manifests 
itself in the residual statistical errors in the estimate of the FED. In comparing these statistical errors for the FSM and the RSM, 
we first note from figure l3^ that for all temperatures of up to T=0.5, the FSM will require a narrower region of Mab space to 
be reweighted (as compared to the RSM) within the MUCA approach 11321 . This has two consequences for the simulation. 
Firstly this will translate to a smaller statistical error in the estimate of JI-ba for a given number of Monte Carlo steps since 
the system will fluctuate over a narrower region of AIab space. Secondly this will correspond to a reduced computational 
effort in the task of constructing the multicanonical weights. However on top of this one must also give consideration to the 
differences in computational effort (i.e. the time for each Monte Carlo step) between the FSM and RSM. In order to understand 
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Figure 3.10: /J^AF^^ as obtained from the MUCA FSM-PS and RSM-PS metlnods 
Note that for the temperatures investigated, MUCA weights had to be employed in order to ensure that the simulation 
able to visit the Mab ^ regions. 

The straight line denotes a linear extrapolation based on the results of the RSM-PS simulations 11281 . 
AF^^ is in units of fc/e. 
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this latter issue more fully, we note that in a MUCA simulation the macrovariable Mj^b will have to be evaluated for each 
Monte Carlo step. Suppose the simulation is in phase 7 and suppose that one employs a short ranged potential. Then a single 
particle perturbation will, for the RSM, require a local re-evaluation of the configurational energies of both 7 and 7 in order to 
compute the new value of Mab- The number of computational steps needed for such a task is 0{N^) = 0{\). For the FSM 
the re-evaluation of £^ will also be local in nature. However since the FSM (see Eq. I3.16> induces a global rearrangement of 
the atoms of phase 7 (so that the calculation of £^ is an 0{N) calculation) the number of computational steps needed for the 
reevaluation of Mab for the FSM will be 0{N). This significant advantage that the RSM holds over the FSM vanishes when 
long ranged potentials are employed, in which case both methods will involve 0{N'^) calculations. 

3.7 Summary 

In this section we have clearly illustrated both analytically and numerically the dependence of the overlap on the representa- 
tion. We have shown that adopting afourier representation of the displacements allows one to cure the overlap problem at 
sufficiently low temperatures. This is in sharp contrast to the RSM, for which the overlap of the two distributions tends to a 
Umiting form. The main benefit of the FSM over the RSM is that for sufficiently low temperatures, extended sampling will 
not be needed in order to arrive at an estimate of the FED which is free of systematic errors. 

However our expectations of being able to estimate the FED via the FSM with increasing ease are not fulfilled due to the 
presence of a signal to noise problem which gets worse as the temperature decreases, even though the overlap between the 
phase-constrained distributions improves. Furthermore by being a transformation which is global in nature, one must expend 
a considerably greater amount of computational effort in dealing with the FSM than is required for the RSM. 

In tackling the problem of estimating FEDs in the most general cases, one must not only give consideration to the choice 
of representation but one must also give consideration to the choices of estimators and the choices of extended sampling 
strategies. The importance of the choice of estimator has already been illustrated to a certain extent in figure ll!9l where we 
have seen that the PS estimator (Eq. I2.78t is better than that of the EP method (Eq. I2.28> . In chapterl^we will address these 
issues in greater depth. In chapter |5l we will then proceed to discuss the use of extended sampling strategies in the task of 
estimating FEDs. 
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4.1 Introduction 

Imperative to the understanding of the FED problem is the appreciation of the distinction that must be made between statistical 
and systematic errors (see section [7.3. 1> . Whereas statistical errors may be reduced to a desired level simply by running the 
simulation for a sufficient duration of time, systematic errors in general can not be controlled in this way. In the context 
of FED calculations the origin of these systematic errors is the partial overlap |2| between P{Mba\t^%) and P{Mba\t^b) 
(see section l23t . As we have discussed in chapter |2] these systematic errors may be minimised through efficient choices of 
PM (that is choices of the global configuration space displacement D and representation v). In the case where it is possible 
to construct an efficient PM so as to yield at least some overlap between P{Mba\tt')^) and P{Mba\t^b)' possible to 
eUminate the systematic errors that arise in ones estimate of JI-ba by constructing an appropriate estimator In this chapter we 
will investigate this issue in two stages. In the first part we will show how one may restrict the regions of Mba space which 
contribute to the relevant expectations (which appear in the estimators) so as to yield an estimate of JI-ba which is free of 
systematic errors. We will then show that an alternative strategy to this is that of employing estimators which are unrestricted, 
in the sense just described, and which are instead designed to have their most significant contributions originating from those 
regions of (effective) configuration space over which both PIMbaIt^a) ^^'^ P{Mba\t^b) overlap 11331 . This latter idea has 
been studied (and understood) in a different way in 1461 . 1531 . 1541 . I134I - I137I . At the heart of our insight is the appreciation 
that only within the region of overlap does the estimator in Ea. l2.22l vield an estimate which is free of systematic errors. Since 
ultimately all estimators may be derived directly from Eq. 12.221 it follows that the successful estimators will be those which 
pool together the estimates of JI-ba made by Eq. \2.22\ withm the region of overlap. 

Consider figure l4m In the most general case, there will be two types of overlap that one encounters when one attempts 
to estimates the FED. In the first case (see figure ITTI (a)) the regions of (effective) configuration space of one of the phases 
forms a subset of that of the other phase. This type of situation typically arises in the calculation of the chemical potential (via 
the insertion method), where one attempts to determine the FED between an N particle system (A) and an N+1 (B) particle 
system (see 1531 for an excellent discussion). In this case it has been argued 1531 that the EP estimator (Eq. 12.261 in which A, 
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the N particle system, is the parent phase) will yield an estimate of the FED which is free of systematic errors. The basic idea 
is that a samphng experiment performed in phase A will capture all the regions of (effective) configuration space relevant to 
phase B |[T38l . 

The latter type of overlap (see figure l4n (b)) appears when one attempts to determine the FED between different phases of 
a single system (see section l23t . In regards to the estimator we have already seen in section ll!6l (in particular figure IT9b how 
the estimator of the EP method (Eq. I2.28> fails in this case, due to the fact that a single phase-constrained distribution fails 
to capture all the important regions of (effective) configuration space which contribute to the FED. One must instead employ 
estimators which involve the sampling of the regions of (effective) configuration space associated with both phases iBSi . 

These estimators, which involve the simulation of both phases, may be broadly categorised into two groups: the phase- 
constrained estimators and the phase-switching (PS) estimators. The phase-constrained estimators involve expectations with 
respect to sampling distributions confined to the phase in which they are initiated, whereas the phase-switching estimator 
involves a sampling distribution which actually switches between the phases. As we will now show, one must in general take 
explicit steps so as to ensure that the phase-constrained estimators are free of systematic errors. We will also show that, for a 
particular subgroup of the phase-constrained estimators, no such steps are needed since these estimators are, by construction, 
free of systematic errors even in the case of partial overlap. In the case of the PS estimator, we show that for partial overlap 
the method can be guaranteed to be free of systematic errors simply by appropriately weighting the two phases (in a way 
as prescribed in the simulated tempering method I49I - II52I . section |2.4.6> so as to increase the probability with which the 
simulation visits the phase with the smaller partition function (greater free energy). In order to keep the discussion as general 
as possible we will formulate our arguments within the context of the EG method. 



4.2 Phase-constrained estimators 

4.2.1 Eliminating systematic errors via restricted expectations 

Suppose that P{Wba\''^'a} P{Wba\''^b) partially overlap O in the manner shown in figure l4!2l Using the same arguments 
employed in section l23] it follows that the point at which they intersect is given by: 

W^^-\n'R^A=P^FBA (4.1) 

We will now proceed to show how, in the case of partial overlap, the FED may be estimated from any estimator (see Eq. 
I2.105> in a way which is free of systematic errors, merely by restricting the range of Wba space from which the non-negligible 
contributions originate. Let us first consider the overlap identity (Eq. I2.101> . Multiplying by an arbitrary non-zero function 
G[Wba) and then integrating both sides over the restricted range W]^^ < Wba < ^ba arrive at a formula which we 
call the restricted dual phase perturbation (RDP) formula: 

^BA / G(WBA)P{WBA\Tr'B)dWBA = / G{WBA)e-^^^ P{WBA\Tr%)dWBA (4.2) 
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Figure 4.1 : Schematic of the way in which the two phases can overlap in (effective) configuration space 

a) Here the regions of (effective) configuration space typically explored by phase B are a subset of those typically explored by 
phase A. 

b) In the second type of overlap there are regions typically explored by each phase that are not visited by the other phase. 
When we refer to 'partial overlap' we will have the case shown in (b) in mind. 

In the whole of this thesis we will only concern ourselves with cases where the overlap is of the type shown in (b). 




Figure 4.2: Schematic of P(VFsA|7r^) and P{WBA\7r%) in tine case of partial overlap 
Using the same arguments as those employed in section l23l it is easy to show that the point at which the two distributions 
intersect is given by Wm = — In ^-ba- 



CHAPTER 4. ESTIMATORS 



92 



or 



where: 



^ < GiWBA)YiWBA)e-'^-- 

■^^^ < G{Wba)Y{Wba) >.j, ^ ^ 



1 : W?,, < Wba < Wl, 
Y{Wba)^{ (4.4) 
: otherwise 



Eq. I4.3l mav then be used to estimate 'JI'ra via: 



c^. Y.\ =lP{WBAA^A)G{WBA^^)Y{WBA^^)e- ^^ 
Y.\=lP(WBAAT^%)G{WBA,i)Y{WBA, 

where: 



'^s.A = zzi — ttt: : — . ^ 



PiWBAA-^"^) - ^1 (4-6) 

VT^^ and W^j^ are in principle arbitrary. In practise, however, they are not if one is to arrive at an estimator which will yield 
an estimate of JI'sa which is free of systematic errors. In order to obtain the necessary insights it is instructive to derive Eq. 
I4.5l directlv from the estimator for 3?cByi associated with the overlap identity (Eq. I2.10H itself: 

^ e.b.^ P{WbaA^'a)^-'^^^'^ ,4 7, 

Jisyi = -J^BA = ; ^ (4.7) 

P{WbaA'^'b) 

where 'Rba is an estimate for IKsa. Rearranging Eq. I4.7l and multiply both sides by 
G{WBA,i)Y {WBA,i) one obtains: 



^BAG{WBA.i)Y{WBA,i)P{WBAA^B) = G{W BA,i)Y {W BA,i)P{W baA^A^^^^-' (4-8) 

Summing both sides over all the bins and rearranging leads to Eq. 14.51 

The necessary restrictions that are needed become apparent when one notices that, implicit in this derivation, is the as- 
sumption that the histograms H{WbaA'^'a) ^^'^ HiWBAA'^%) of 'he bins Wba.i over which the summations are per- 
formed are simultaneously non-zero. This requirement stems from Eg. 14.71 which itself assumes that both P{WBA.i\T^A) ™d 
P{WBA.i\T^B ) ^1"^ non-zero. However the regions of (effective) configuration space over which the estimators P{WbaA'^a) 
and P{WbaA''^b ) °f '■^^ phase-constrained distributions are both non-zero is precisely what we defined (in section lT^ to be 
the region of overlap. In other words the widest choice ofWgj^ < Wba ^ ^ba should correspond directly to the region over 
which the estimators P{Wba\'^'a)'^^'^ P{Wba\t^'b) of the phase-constrained distributions overlap (i.e. the shaded region in 
fisureWli. 

The key point is that within the overlapping region each bin has, associated with it, an estimate of JI-ba given by Eq. 14.71 
One may then pool these estimates together in different ways; the result is the array of different estimators whose form is 
most generally given by Eq. 14.31 This idea is illustrated in figure |431 As we will show in the next section the acceptance 



CHAPTER 4. ESTIMATORS 



93 



The different estimators, which are free 
of systematic errors, then pool these 
estimates together in different ways. 

/The most well known examples of such 
estimators are the AR and FF estimators 

At each bin in the region of overlap 
the value of Rba niay be estimated 
from the overlap identity. 




Figure 4.3: Schematic illustrating the principle workings of Eq. 14.31 
This figure illustrates the fact that the phase-constrained estimators whose estimates of JI-ba are free of systematic errors are 
those whose non-negligible contributions solely come from the region of overlap. 



ratio (AR) and fermi function (FF) are prime examples of estimators which pool the estimates in this way and which do not 
require any restrictions (see Eq. 14. 4t to be imposed. The phase switch (PS) method is another such method, which accounts 
for all the regions of (effective) configuration space which contribute non-negligibly to the FED by actually switching phases 
(in the case of zero equilibration, see section l2'.4.7> or more generally switching between processes (in the case of the arbitrary 
equilibration EG method, see section l2'.4.9l l. The PS method will be discussed later in section l431 

Some flexibility does exist in setting the range Wgj^ < Wba < M^s^i- SpecificaUy the range W^^ < Wba < W^sa '^^^ 
be widened so as to include Wba macrostates originating from outside the region of overlap, provided that they contribute 
negUgibly to the relevant estimators. Since from the overlap identity (Eq. I2.101> we know that: 

P{WBAWB)'^e-'^^^P{WBAWA) (4.9) 

it foHows that regions which are negligible to the estimator of the numerator of Eq. I4.5l are also negligible to the denominator, 
and vice-versa, which makes them easily identifiable. For example, in the case where G{Wba) = 1, these regions are those 
over which P{Wba\''^%) is negligible. This includes all the parts of P{Wba\t^'x) which do not overlap with P{Wba\i^%), 
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which means that in the task of estimating 'JI'sa from Eq. l4.5l one may remove the upper restriction W^y^ (when G{Wba ) = 
1). The sole purpose of this is merely to simplify the evaluation of the relevant estimators (see section l4~4t . The important 
point is that the regions which do contribute non-negligibly to these summations are limited to the regions of overlap. 

4.2.2 Eliminating systematic errors via G{Wba) 

We have seen in the previous section how one may construct an estimator based on a given G{Wba) which is free of systematic 
errors merely by restricting the expectation, so as to ensure that the Wba macrostates which contribute non-negligibly to the 
estimator come from within the overlapping region (see figure l4!2l figure l43t . Since the most significant contributions to the 
numerator and the denominator of Eq. 14.31 come from the same regions of (effective) configuration space (by virtue of Eq. 
14.91 1 we see that an alternative strategy is to construct a G{Wba) so as to ensure that the non-negligible contributions originate 
from the regions of Wba space over which P{Wba\''^'a) and P{Wba\i^%) overlap. In this case the restrictions imposed on 
the expectations in Eq. I4.3l mav be lifted. 

In order to facilitate our analysis let us define a set of weight functions (not to be confused with MUCA weights) for the 
estimator of 'JI'ba- In the case of the DP estimators (Ea. l2.105t let us define a weight function WniWsA) as: 

Wn{WBA) - G(W^sA)e-^-^P(T^i3Akl) (4.10) 

and a weight function WdiWsA) as: 

Wd{WBA) ^ G{WBA)P{WBA\Tr%) (4.11) 

These weight functions essentially measure the contribution of a macrostate Wba in the numerators and the denominator of 
the estimator of Eq. 12.1051 Since from the overlap identity (Eq. I2.101> we know that: 

Wn{WBA) = ^^AWd{WBA) (4.12) 

we will only concentrate on Wn in the following analysis of the DP estimators. 

In the case of the EP estimator, which is obtained by setting G{Wba) = 1 in Eq. 12.1051 we will depart from the 
definitions given in Eg. 14. 101 and Eg. 14.1 H and instead define the weights in accordance to the contributions of macrostates to 
the numerator and denominator of the corresponding estimator: 

1:\^iH{WbaA'^a) 

That is we define the weights as: 



WniWBA) = e-'^^^P{WBA\T^A) 

CX P{WbaU%) 



(4.14) 
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and: 



WdiWsA) = PiWBAlTr^A) (4.15) 
In this case one must separately analyse Wn and Wd since they are no longer proportional, as is the case in Eq. 14.121 

Let us now motivate the construction of estimators in which no restrictions of the form of Eq. 14.41 are needed. From Eq. 
14.91 it is clear that the choice G{Wba) = 1 in Eq. 12. 1051 results in the most significant contributions originating from the 
regions where PiWBAlT^s) most significant. On the other hand choosing G{Wba) — e^^^ (this merely corresponds to 
performing the EP method in the other phase) results in the contributing regions being those over which P{Wba 1"^%) most 



significant. For reasons mentioned in section 14.2.11 both these choices can only be guaranteed to yield estimates of 3?syi 
which are free of systematic errors (in the case of partial overlap) by imposing the restrictions mentioned in the previous 
section. One may, then, naively expect that the construction of an interpolation G{Wba) = [1 + e^^-*]/2, which leads to the 
following formula: 

- <|l + ..w„| - (4.16, 



to lead to a more useful estimator of JI'ba- This is not in fact the case. To see this we first notice (from Eq. I4.1Q> that the 
weight function w„(W^ba) is given by: 

WniWBA) = [1 + e-'^^^]P{WBA\7T'A) (4.17) 

SO that: 

' PiWBAW^A) ■■ forWBA>0 

Wn{WBA) ~ { . (4.18) 

^^APiWBA\TT^B) ■■ for Wba <0 



It is immediately apparent from Eq. I4.18l and figure|42|that the regions of Wba space which contribute significantly are those 
regions of Wba space spanned by both P{Wba\'^'a) P(WsaKb), simply limited to the regions of Wba space 

over which the two phase-constrained distributions overlap, as one might originally expect. Therefore Eq. I4.16l has not got the 
desired property that we are looking for, namely the property of having the non-negligible contributions coming solely from 
the region of overlap. 

Now let us examine the choice of G{Wba) = A{--Wba), which leads to the acceptance ratio (AR) formula (Eq. l2.104t . 
As we will now see, the AR formula is a prime example of an estimator which does not require restrictions to be imposed on 
its corresponding estimator in order to guarantee that it is free of systematic errors. In order to see this, let us estabUsh the AR 
formula in a slightly more general way. To do this we first re-write the overlap identity (Eq. I2.10H as: 

G{Wba - C)P{WBA\7r'B)^'BA = e-^'GiWBA - C)e-[^«^"^lp(I^BA|7rl) (4.19) 
where C is an arbitrary constant. Integrating both sides over Wba and rearranging gives: 

„ < G{Wba - Cje-I^^^-c-] > 

^23^ = e""^ 'r^n,, r^w ' (4.20) 

< G{Wba - C) >^e 
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Following our earlier definitions (see Eq. I4.10l and Eq. 14. lU we define the weight functions as: 

WniWBA) - G{Wba - C)e-[^«-^-^lp(T4^BA|7r^) (4.21) 

and Wd{WBA) as: 

WdiWsA) = G(Wba - C)P{WBA\7rB) (4.22) 
The interrelation between the weights may now be written more generally as: 

WniWBA) - e^:R^AWdiWBA) (4.23) 

Once again it suffices to focus ones attention on only one of these weight function (which in the following analysis will be 

Wn)- 

The constant C in Eq. 14. 201 is important in that it directly affects the statistical and systematic errors associated with the 
corresponding estimator. We will return to the optimal choice of C later If one substitutes G{Wba) = [W^ba ^ G]) into 
Eg. ion one obtains a generalisation of the AR formula, Eq. 12.1041 

„ < A{Wba - G) >^c 

^"-^^ <A(-[»-..-ci)>:, 

whose weight function is given by: 

Wn (Wba) = A{Wba - G)PiWBA\Tr'A) (4.25) 

It then follows that: 

{e^^^ApiWBAln'e) for Wba > G 
e^^^ApiWBAlTT's) = P(M^BAkl) forT^sA = G (4.26) 
PiWBA^'A) for Wba <G 

Referring to figure l4~4l (a) and (b), it is clear from Eq. I4.26l that if C lies within the overlapping region, then the regions which 
contribute most significantly to the estimators of the expectations appearing in the numerator and denominator of Eq. 14. 241 are 
those over which P(VFByi I'l'^i) and Itt^) overlap. In the case where the FEDs are small, it suffices to use the original 

AR formula Eq. 12.1041 since W,„ w 0. 

Therefore the AR method is only free of systematic errors for choices of C lying within the overlapping regions. If the 
value of C lies outside this region, it is not hard to see that the regions of (effective) configuration space which contribute 
the most significantly to Eq. 14. 241 will no longer be contained entirely within the region of overlap. As a result restrictions 



will have to be imposed on the corresponding estimator (for reasons mentioned in section l4.2.U in order to guarantee that the 
associated estimate of JI-ba is free of systematic errors. We will now discuss another estimator which, like the AR method, 
has its most significant contributions originating from the region of overlap and which is unique in that it is the estimator for 
which the statistical variance is a minimum. 
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a) P(Wba) 




W„ C 



Figure 4.4: The weight function Wn{WBA) for a given C. 
Figure (a) shows the portions of the distributions which contribute to the estimate of the FED when C is displaced from W, 
Figure (b) shows the resulting weight function Wn{WBA) for the AR method (see Eq. l4.26> . 
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4.2.2.1 Minimising the statistical errors : Bennett's fermi function estimator 

The AR formula is but one of a family of estimators for which the regions which contribute most significantly correspond 
to the overlapping regions. The question that one may now proceed to ask is which, out of the family of these estimators, is 
the one whose corresponding estimator for AFba is of minimum statistical variance II139I . The task of finding a minimum- 
variance estimator has been tackled by Bennett 1461 . We wiU now present his estimator within the more general context of the 
fast growth (FG) method. Consider the following choice for G: 

G{Wba-C) = f{-[WBA-C]) (4.27) 
where f is the fermi function. Substitution of EQ. l4!27l into Ea. l4!20l vields l46l . l54l . fT40l - fT42l . fT43l : 

< /{-[Wba - C]) >^|^ 

where we have used the fact that: 

/(^) 



e-=" (4.29) 



/(-^) 

Bennett showed that the choice of G and of C which lead to a minimum variance estimator of AFba is that of Eq. I4.28l in 
which C is set to be: 

C = W,„ - In — (4.30) 

riB 



and where are the number of independent data samples obtained in phase 7. Eq. I4.30l has the following simple physical 
interpretation. When Uy^ < ub, then C > Wm so that (with respect to the case C — W,n) increasing amounts of the tail 
of P{Wba\t^a) included in the contributions that come from the overlap region, whilst a smaller proportion of the tail of 
PiWBAlT^B) is included. The reason for this is that the statistics of the tail of P(WBA|7r5i) will be a lot worse than that of 
-P(W^Byi I'l's), and therefore it makes sense to take contributions from a larger proportion of its tail and a smaller proportion of 
the tail of /-"(Wsyi [tt^). When ua > nB the opposite is true. That is since C is now less than Wm, a larger proportion of the 
tail of P(M^BA Itt^) is taken into account, whereas a smaller proportion of the tail of P{Wba\t^a) contributes, thus balancing 
the fluctuations contributed by P{Wba\t^a) PO^baIt^b) estimate of the FED. This is illustrated in figure l431 

In order to estimate the FED from Eq. 14.28! and Eq. 14.30! Bennett's prescription involves iteratively solving the set of 
equations: 

Ef=lH{WBAA^B)fi-[WBA,^ - C]) ns 

and: 

C = Wm~ln— (4.32) 
Ub 
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Figure 4.5: Schematic illustrating the idea behind Bennett's prescription for C (Ea. l4.30> 

According to Bennett 1461 the optimal estimator is given by Eq. 14.281 where C is given by Eq. 14.301 This choice of C has 
a physical interpretation. In short, when phase A is sampled less well than phase B, then the choice of C ensures that one 
includes an increasing proportion of the distribution of phase A in estimating the FED in order to compensate for the increased 
statistical errors associated with the estimator P(Wsyi|7r^). This is illustrated schematically in figure (a). On the other hand 
when phase B is less well sampled, then the opposite is done, so that one includes a greater proportion of [tt^). This 

is shown in figure (b). 
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where Wm is the estimate for Wm (see Eq. 14. U . That is one starts off with an arbitrary estimate of Wm, say unity. One then 
uses Eq. l4.32l to calculate a value of C, which one then substitutes into Eq. 14.311 From this one obtains a new estimate Wm 
which one then substitutes back into Eq. l4.32l to get yet another value of C. This value of C is then fed back into Eq. 14.3 H and 
one continues this procedure until convergence is obtained. That is the process is carried out in an iterative fashion until the 
value of Wm obtained from Eq. 14. 31 I f or a particular value of C also agrees with Eq. 14.321 At this point Wn yields (in the case 
of partial overlap between P{Wba\t^a) ^'^'^ P{Wba\'^b)'^ a minimum variance unbiased estimate of the true value of Wm- 

Wm Wm (4.33) 

Like the generalised AR formula (Eq. I4.24t . no restrictions are required (when C does not differ too greatly from Wn) in 
order to ensure that the associated estimate is free of systematic errors (in the case of partial overlap). To see this we first note 
that the weight function w„ (Wba) is given by: 



WniWBA) = 



P{WbaK) 

PiWBA\n%) 

P{WBAK) PiWBA\7r%) ^4 34^ 



P{Wba\t^%) , p^w„ , |_-c \ 
e^SByi ^ ^(yVBAlTrB) 



where in going from the first to the second line we have employed the overlap identity (Eq. I2.10H . 

From Eq. 14.341 we see that (provided C does not differ too greatly from Wm, so that c'-^JI-ba ^ 1) the regions of 
Wba space for which the weight function wa{Wba) is most significant are those regions over which P{WBA\TrA) '^'^^ 
PiWBAlT^s) overlap (the shaded region of figure 1431 . Therefore like the generalised AR formula (Eq. 14.241 1. the fermi 
function (FF) formula (Eq. I4.28t is free of systematic errors (provided C does not differ too greatly from Wm)- 

Let us now analyse the case where C differs significantly from Wm- If e'-^'^'BA is considerably different from unity then it 
follows that the weights Wn{WBA) may be approximated by: 

A J HWbaItt^a) ■■ iie^^^A»l 

WniWBA) oc < ^ (4.35) 

P{WBA\n%) ■- ife^3ls^«l 
Therefore if C is too large, then the Wba macrostates contributing to the estimators of Eq. 14.281 correspond to the regions 
of Wba space for which P{Wba\t^a) significant. In this case the estimator of Eq. 14.281 will not yield an estimate free of 
systematic errors, since the contributing regions no longer come from the regions of overlap, and the necessary steps outlined 
in section 14.2.11 will need to be taken. Likewise if C is too negative, then the important regions will be those for which 
-PIW^baKb) is significant, once again leading to systematic errors. From this analysis we also see that in the limit of the 
number of independent samples obtained in the two phases (ua and ub) becoming very disparate, the Bennett prescription 
for constructing the optimal C, given by Eq. 14.301 will lead to systematic errors for the reasons that we have just mentioned. 
In his paper 1461 however, Bennett does advocate the use of an equal number of independent samples in each phase, so that 
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C = W,n- This choice leads to the contributions coming from the regions of overlap, resulting in the estimator of Eq. 14.281 
yielding estimates which are free of systematic errors. Further insight into Bennett's approach (Eq. I4.28l and Eq. I4.30> can be 
obtained by noticing the links that exist between the method and the task of estimating the overlap parameter O (Eq. I2.18> . 
We refer the interested reader to appendix|3for the relevant discussion. 

4.3 Phase switch estimator 

In section l2'.4.9l we saw how the PS method could be generalised so as to be applicable within the framework of the (arbitrary 
equiUbration) EG method. In order for the method to work K^) and K^j) should be non-zero for the Wba ~ 

regions. It is only in this case that a simulation initiated in either of the phases will be able to reach the Wba ^ regions, 
from which it will have a non-negligible chance of switching phases. In the general case where the FED differs significantly 
from 0, the estimators P{Wba\''Ta) ^(W^saKb) ^^^^ '^^^ ^^'^^ °f ^^^^ (^^^ figure l4!2l . In this case one might 
find that the Wba ^ regions are not visited, thus preventing the simulation from switching phases. As we will now show, 
provided P(WByi|7r^) and P{Wba\t!'%) overlap, one may make a slight modification to the method so as to allow it to 
switch phases. The basic idea is to weight the two phases so as to increase the probability of the simulation visiting the phase 
with smaller partition function (or larger free energy). We are essentially performing the ST tempering for the case of two 
sub-ensembles, within the more general context of the EG method. 

We recap that in its most general form the Wba distribution of the PS method may be written as: 

P(T^SAkps) = PiWsAXA^BlTT'ps) + PiWsAXB^AlTT'ps) (4.36) 

where: 

P(W^BA,C7-7kp5) = PiWBA\7r;)P{C^^^,7r'ps) (4.37) 
In the particular version employed in Eq. 12.761 we have P{(y^ij, TTpg) = Z^. In general, this quantity may be arbitrary. It 
then follows that the acceptance probabiUty for switching phases (strictly processes) is given by: 

P (r U'^ \ Min/I PiWBA,CB^A\TT'Ps) ^ 

= Min{l,^^g^=AKk) (4.38) 



Since iK^yi is unknown a-priori, it is convenient to factor it out of the acceptance probability of Eq. 14.381 Therefore we 
conveniently write PiC-y^jlT^ps) ^s: 

P{C,^,\n'Ps)- ^ (4.39) 

where are some arbitrary weights, which are known a-priori. Substituting Ea. l4.39l into Ea. l4.38l we obtain: 

PaiCA^B ^ Cb^aItt^s) =Min{l,e-'^--+'"{'^«/"'->} (4.40) 
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or: 



where: 



, Min{l,e-^«'^+'^| : if7=A 
Min{\,e^^^-^} : if7=B 



C = In — (4.42) 

WA 



Running the argument in reverse it follows that if one adopts the PS acceptance probability given in Eq. 14.401 then the 
absolute probability of finding the simulation in the 7^7 process is given by Eq. 14.391 Therefore if one implements a 
FG-PS simulation in which the phase (or process) switching probabilities are given by Eq. I4.41l then 'SI'sa may be estimated 
via: 



•BA 



wbP{Ca~.bWps) 

< Oy^A >7r|.s 



where: 



1 : if simulation in 7 ^ 7 process 
^7,7 = <( (4-44) 
: otherwise 

Eq. I4.43l merelv expresses the fact that JI-sa may be estimated by the weighted ratio of the times spent in the two processes. 
An alternative expression for H-ba may also be found which expresses Ea. l4.43l as an expectation over the macrostates Wba- 
By substituting Eq. l4.37l into Eq. I4.36l and by appealing to the overlap identity (Eq. I2.10U one finds that: 

P^.. , I c , , [l + e~'^--+^]-'P{WBA\nj.s) ■■ l^A 

P{WBAX^^=tUps)^ { , , , , . ^^-^^^ 



Using Eq. I4.37l and Eq. I4.45l we see that Eq. I4.43l mav instead be written as 



•BA — 



-C 



J P{WBA,(:B^A\7T'ps)dWBA 



J P{WBA,CA^B\TT'ps)dWBA 

_c <![! + e'^---(']-^PiWBA\7r%s)dWBA >.^, 

< J[l + e~^BA+C^^lp(^WBA\7T'ps)dWBA 



(4.46) 



_^ < fiWBA - C) >,c 



The close resemblance of this estimator with that of the FF method (Eq. I4.28> is striking. However the estimator of Eq. 14.471 
is markedly different from that of Eq. 14. 281 in one respect. To see this consider the weight function for the numerator and 
denominator of the estimator of Eq. 14.471 
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WniWsA) = fiWsA - C)PiWBA\7r'ps) = P(VKba, CiS-A^Ps) 

cx P(t^BAkB) (4.48) 



and: 



WdiWBA) = fi-[WBA-C])PiWBA\Tr'ps)^ PiWBAXA^Bln'ps) 

CX PiWBA\n^A) (4.49) 

We notice that whereas in the case of the DP estimators (Eq. 12.1051 1 the weights w„ and Wd are directly proportional to each 
other (see Eq. I4.23> . in the case of the PS estimator they are not. For the DP methods, the contributions to the estimators of 
the expectations appearing in the numerator and denominator come from the same region of (effective) configuration space, 
though the sampling distributions actually employed are different. In the case of the PS method one employs the same 
samphng distribution for the two expectations; though now the contributions to the two expectations come from different 
regions of (effective) configuration space. Whereas the DP methods can prevent the appearance of systematic errors by 
ensuring that the non-negligible contributions come from the region of overlap, the PS method avoids systematic errors by 
actually switching phases and separately sampling each phase. As with the DP methods the correct choice of C must be made 
in order for the PS method to work. 

In order to address the choice of C we note that if the weights wa and wb are the same for the two phases (so that C=0, 
Eq. I4.42L as is the case in the original PS formulation (see section l7.4.7l and section l7.4.9> , and if the region over which the 
two phase-constrained distributions overlap (see figure l4!2t is sufficiently displaced from the origin, then it is clear that the PS 
sampling distribution will not be able to successfully switch between the phases in both directions. The way to remedy this is 
to choose a C which lies within the region of overlap (see figure l4!2t . In particular if one chooses: 



C = Wm (4.50) 

then from Eq. l4.43l one finds that: 

< ^7,B >7r1 



'PS 



1 (4.51) 



SO that the simulation spends an equal time in the two phases. By setting C as prescribed in Eq. 14.501 what one does is to 
effectively bias the phase with the larger free energy, so as to increase the probability with which the simulation visits it, as 
compared to the case where C is set to unity. 



4.4 Numerical results 



We saw earlier in section l4.2.1l how any estimator can, in principle, be modified by imposing appropriate 'restrictions' so as to 
guarantee that the resulting estimate of the FED is free of systematic errors when there is some overlap between the estimators 
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of the phase-constrained distributions P{Wba\t^a) P{Wba\t^b)- ^^^^ section we illustrate the application of these 
restrictions in the case of the EP estimator and compare the resulting statistical errors to those of the PS, AR, FF, and EP 
methods. 

In order compare the estimators we used the same simulation setup as that used to obtain the data of figure ll!9l r i44l . 
This represents the rather uninteresting case of estimating JI-ba when its assumes a value of approximately unity. However 
it is useful for the reason that, since the value of JI-sa hardly changes for the range of temperatures investigated, one may 
effectively probe the behaviour of the statistical errors purely as a function of the overlap; the overlap being changed simply 
by varying the temperature. 

We start by recalUng (see section |4. 2. 1> that in the case of G{Wba) — 1 the form of Eq. 14.31 mav be simplified by 
discarding the upper limit W^j^. Furthermore since JI-ba (and therefore Wm) varies negligibly over the range of conditions 
investigated here (see the results of the FSM-PS method in figure IT9t . it is convenient to set Wgy^ to W„i. The result is that 
in the case of zero equilibration Eq. I4.3l reduces to: 

<e-^--H{WBA-Wrn)>^^^ 

= < mwBA w„.) ^"-'"^ 



where: 



, 1 : a; > 

H{x) = { (4.53) 
: otherwise 



Using the fact that Wm ~ for the conditions investigated here, we see that in the case of zero equilibration Eq. 14.521 
simplifies to: 

< e-^'''^ Hi Mb a) 

^■ZA = 7 — ^ ' ^ (4.54) 

<H{Mba)>.^^ 

We note that though Eq. 14.541 does omit some of the region of (effective) configuration space over which the two phase- 
constrained distributions overlap, and therefore has greater statistical errors than it would if all the regions of overlap were 
included, its advantage lies in its simplicity and in the fact that Eq. I4.54l mav be used for the spectrum of overlaps investigated 
without requiring one to modify the restrictions as the overlap changes. It is for this reason that we will use Eq. I4.54l in our 
comparison of the estimators. 

Figure l4!6l shows the statistical errors (and the associated errors of the errors) in the estimates of '51'ba for the different 
estimators as a function of the overlap parameter O. In comparing the different estimators we once again use the PS estimator 
as the benchmark. The first observation that we make is that the AR and the FF estimators (the latter of which is not shown in 
figure l4~6l since its results were identical to those of the AR method) yielded statistical errors of roughly the same size as those 
of the PS method for the whole range of overlaps O investigated. The EP estimator, on the other hand, yielded a markedly 
different behaviour to that of the PS estimator. 

For high values of overlap O the EP estimator clearly yields roughly the same statistical errors as those associated with 
the PS estimator, whereas for low overlaps the errors of the EP estimator are significantly greater This may be qualitatively 
understood by noting that for high overlaps, the distributions P{Mba\''^'a) ^i^d P{Mba\''^'b) sufficiently overlap so as to 
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ensure that the statistics of the regions which contribute to both the numerator and denominator of Eq. 12. 281 are good under a 
samphng experiment constrained to a single phase. In the case of the PS method, the statistics of the macrostates relevant to 
the numerator and the denominator (Ea l2.78> are good because the method (by switching phases) separately visits the regions 
of (effective) configuration space associated with the two phases. 

As O decreases, the two phase-constrained distributions increasingly separate (see figure ISTSt until the point is reached 
where the main body of P{Mba\'^b) resides in the tail of P{AIba\t^a)- Under these conditions even though systematic 
errors will not be present for the HP estimator, the statistical errors will be greater than those associated with the PS estimator 
since now the macrostates which contribute to the numerator of Eq. I2.28l will be visited with a small probability, even though 
they contribute significantly to the estimate of the FED. On the other hand the PS method visits macrostates with probability 
in direct proportion to their contribution to the relevant estimator (see Eq. I2.76> . As a result the statistical errors of the PS 
estimator in this case will be lower than that of the EP method. For overlaps even lower than this, the systematic errors will 
begin to set in for the EP estimator, since the regions associated with both phases will not be visited (as is required) by a 
simulation constrained to a single phase, even though systematic errors will not be present for the PS method (see also section 
13. 6> . For the experiments conducted here, it was found that systematic errors begin to set in for the EP estimator for overlaps 
below O ^ 0.3. 

In contrast to the EP estimator, the estimator of the REP formula (Eq. I4.54> does not suffer from any systematic errors for 
the whole range of overlaps investigated, and its statistical errors were only marginally greater than those of the AR, PS, and 
FF estimators. The main reason for the increased statistical errors is because the restriction, as imposed in Eq. 14.541 excludes 
some of the overlapping region. That is some of the 'useful' contributions are unnecessarily discarded, resulting in slightly 
higher statistical errors. It is also for this reason that at high overlaps the error of the EP estimator falls below that of the REP 
estimator. We stress that this property of the REP estimator is merely an artifact of the particular version of the restrictions 
we employ in Eq. 14. 541 and that, upon inclusion of all the region of overlap, the statistical errors should fall to roughly those 
of the other methods. 

We finally note that in between the high and low overlap regimes, there is a range of overlaps for which the EP method 
is free of systematic errors and yet for which the statistical errors are greater than those of the REP method, despite the fact 
that the particular version of the REP that we use discards some of the data originating from the overlap region. In figure 
14.61 this roughly corresponds to the range 0.3 < O < 0.8. To understand this we recall that as the overlap is increased, 
systematic errors will disappear for the EP method on the onset of the main body of P{Mba\t^b) being contained in the tail 
of P{Mba\'^a)- However in this regime the statistical errors in the estimate of JI'ba will be large since the statistics of the 
regions over which P{Mba\t^b) significant will be poor, since this is contained in the tail of P{Mba\t^'a)' ^^^^ offsetting 
any gains it has over the REP method. This drawback of the EP method will reduce as the overlap increases, until eventually 
the EP method becomes more efficient than the version of the REP formula that we use. 

From this numerical study it is clear that statistical considerations lead one to the conclusion that in the case of partial 
overlap (see figure l4n (b) and l4.2> . estimators which involve the accumulation of data from both the phases (e.g. the PS, 
AR, FF, and REP estimators) are preferable to those that involve estimators which use the data acquired from a single phase 
(namely EP method). Out of these estimators, the PS, FF, and AR method are preferable (to the REP method) since one does 
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not need to expend the additional effort of restricting these estimators (as is done in Eq. 14.31 1. In comparing the AR, FF, and 
PS methods we note that the PS method requires a single simulation to estimate 'JI-ba, whereas the FF and AR methods both 
require two separate simulations. This latter property can be viewed as an advantage for both groups of methods. On the 
one hand it affords the FF and the AR method an avenue for paralleUsation which is not available to the PS method, since 
the two phase-constrained simulations may be performed independently. On the other hand the ability of the PS method to 
estimate 'JI-ba from data extracted from a single simulation makes it, in some sense, tidier. Another important difference is 
that the adjustment of C, so as to yield an estimate of JI-ba which is free of systematic errors when there is partial overlap, 
have to be made before the simulation is run in the case of the PS method. In the case of the FF and AR methods, these 
adjustments are made after the simulation is run, when one is trying to estimate "R-ba from the data already obtained; this may 
be easily automated. Since a-priori we do not know where the region of overlap is, it is clear that in this case the FF and AR 
methods have an advantage over the PS method. This, however, is not a significant advantage since one may run two short 
simulations, one in each phase, in order to roughly determine the point M,„ where P{Mba\t^'a) and P{Mba\i^'b) intersect, 
thereby yielding an appropriate value of C (see Eq. I4.50t . 

4.5 Conclusion 

We saw in section 12.31 that central to ones ability to estimate the FED is the concept of overlap between the estimators 
P{Wba\i^'a) aiid P{Wba\t^b) of the phase-constrained distributions. For systems characterised by a (effective) configuration 
space structure as shown in figure l4Tn (b'). successful estimators based on the sampUng of the phase-constrained distributions 
will have their most significant contributions originating from the region of overlap |46|, US, |54|, |62|, C34|. The way 
that we have realised this idea is by appreciating that all estimators are based on the overlap identity (Eq. I2.101> . Since the 
corresponding estimator of the overlap identity, Eq. 14.71 is itself only valid within the region of overlap, we see that estimators 
which are free of systematic errors can only have their non-negligible contributions coming from this region. In a sense one 
may think of these estimators as pooling together the estimates of 'Rba, as made by Eq. 14.71 from within the region of overlap 
(see figure l43t . 

An alternative, and equally suitable, strategy to the phase-constrained simulations is the PS strategy yj in which one 
actually switches between phases (or more generally between A ~* B and B A processes in the case of the arbitrary 
equilibration FG method). In this case the method overcomes the problem presented by partial overlap by actually switching 
between the phases (or processes) thereby sampling each phase (process) separately. 

Generally however, the scope for refinement of the estimator is limited. In the absence of overlap one must resort to some 
sort of extended sampling strategy 1211 in which one engineers overlap by forcing the simulation to visit regions of (effective) 
configuration space which it would not otherwise sample (under the influence of the canonical sampling distribution Eq. I1.46> . 
This, after the choice of representation (chapter|3} and the choice of estimator (the present chapter), forms the final part of the 
overall strategy of tackling the overlap problem. This topic of discussion will form the core of the next chapter. 
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Figure 4.6: Errors in JI-ba as a function of tine overlap parameter O 
The same amount of computational resources were allocated to all the simulations. The range of the temperatures employed 
here were the same as those in figure ll!9l and the FSM was employed. We tested 5 estimators: the AP, EP, FF, REP, and PS 
estimators. C=0 for all the estimators. The results of the AR and FF estimators were identical, and are denoted by a single 
line. O is given by Eq. 12.191 

All the simulations were zero-equilibration EG simulations (i.e. where Wba = Mb a)- 
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Sampling Strategies 

5.1 Introduction 

In the case where one is unable to construct a PM which ensures some overlap between the two phase-constrained distributions, 
one must engineer overlap by refining the sampling strategy II21I . Studies until now have focused on methods which fall into 
one of three broad categories: 

1 . They sample from some form of extended sampling distribution II21I and extract the FED via an appropriate reweighting 
scheme (see Eq. I1.32> . The extended sampling strategy involves the employment of a non-canonical sampling distribu- 
tion so as to allow the simulation to visit wider regions of (effective) configuration space than it normally would under 
the canonical distributions. Such methods include Umbrella sampling 1451 . 1621 . I69I - I71I . PS method |2, I34I - I37I . 
Simulated Tempering |49l-(52l, and the Weighted Histogram Analysis Method ||54l, l9n-l94l. 

2. The fine tuning of the Fast Growth (FG) Method 1551 . 172I - I75I . 11081 . By making the incremental perturbations to the 
configurational energy (which constitute the work elements of the process, Eq. 12.931 1 sufficiently small and by choosing 
sufficiently long equilibration times between these work elements, this method allows for the engineering of overlap 
between P{Wba\i^'a} ^^d P{Wba\t^'b) (see section lS^ . 

3. They split the calculation of 'R-za (Eq. I2.13> into many small and separate FED calculations, between pairs of systems 
whose phase-constrained distributions overlap considerably better than that exhibited by the original pair of systems. 
These methods are generally referred to as the multistage (MS) methods |45l, |47l, |48|, (62], (69l-(7ll, 11451 . In the 
limit of an infinite number of stages we arrive at the thermodynamic integration method (Eq. I2.52> I23I - I25I . 

In this chapter we will study these three strategies in the following manner First we will deal with point 1 by showing 
how the EP, AR, and PS methods can be made to work by appealing to the MUCA extended sampling strategy, as described in 
section l2'.4.3l The generalisation of this strategy to the case of an arbitrary estimator (Eq. I2.34t is straightforward. Following 
this we construct a new way of estimating the FED in which one employs a series of parallel simulations (as is the case for 
the WHAM method, section l2'.4.5> . In certain limiting cases this new method may be thought of as a realisation of both the 
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FG method (point 2) and the MS method (point 3). In the final part we illustrate point 2 by applying the FG method to the 
model systems under consideration here (see section l3T2t . and show how it overcomes the overlap problem by means of the 
fine-tuning of the relevant parameters (see sections l2.4.8l and l5.4> . 



5.2 The Multicanonical strategy 



The MUCA strategy 11461 is a serial strategy (used in the case of zero-equilibration FG simulations, Wba=Mba) and 
involves, as we saw in section U^.SI and B^.TK see figure l23t . the 'warping' of the canonical distribution so as to produce the 
necessary bridging distribution 11471 . Suppose that vr"^ and vr™ denote the canonical and the MUCA sampling distributions 
respectively. By accepting moves via 11481 : 

-/3£(v')-r,(AfBA(v')) 

PJv-> v'Itt™) =Min{l, -Tor-. — , I (5.1) 

one realises: 

P(MB^|7r")=P(MB^|7r^)e-''(^«^) (5.2) 
In order for P(MB^|7r™) to he flat over the desired regions of (effective) configuration space one sets: 

In this section we will use the Wang-Landau 1851 method to obtain the weights (see section l2.4.3t .and we will focus (our 
discussion) on three estimators, the EP, AR, and PS estimators. Generalisation to the general estimator of Eq. I2.34l is straight- 
forward. 



5.2.1 The Exponential Perturbation estimator 

In section l2'.4.2l we saw that it was the failure of a simulation constrained to a single phase to account for the typical config- 
urations of both phases which ultimately led to the failure of the EP estimator, even in the case of partial overlap. That is a 
sampling experiment performed in phase A (via tt^) only samples the macrostates for which the weights Wd{MBA) (see Eq. 
I4.15> are non-negligible, and fails to capture all the regions of (effective) configuration space for which ?«„(Mba) (see Eq. 
I4.14t is non-negligible. 

The way this problem is remedied (see 1621 and section l2'.4.3t is by constructing a MUCA distribution P(AfByi |7r™) which 
contains P{Mba\t^a) ^^'^ P{M Itt^), so that the simulation visits the regions of (effective) configuration space associated 
with both phases. 

5.2.2 The Acceptance Ratio estimator 

Consider the use of the AR formula (Eq. 12. 3H in the absence of overlap (see figure lZ2t . In this case the MUCA strategy 
involves the construction of two separate MUCA distributions tt^* and tt^. tt^' has to sample all the macrostates Mb a 
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which contribute non-neghgibly to < A{MBji) >tx'\ and tt^ has to sample those which contribute non-negligibly to < 
A{—Mba) >7r^- As we saw in section 14.2.21 these correspond to the regions of overlap. However, unlike the case where 
the estimators P{Mba\'^'a) ^i^d P{]^Iba\'^'b) overlap, in the case where they do not overlap it is not clear a priori where 
these regions are. To determine them one may plot a graph of the weight function WniMBA) versus Mba, as one constructs 
the MUCA weights. Once the MUCA distributions P[Mba\i^^a ) and P{Mba\''^^) are wide enough so as to contain all the 
regions over which w„(Mb^) is non-negligible 1 149 1 one may then proceed to estimate iK^yi via: 

e^. T.UA{Mba,.)P{MbaA^a) (5 4) 

Y!'i=lA{-MBA,i)P{MBAA'^%) 

where: 



P(Mba,.|<) = . ' • ' ^/ . (5.5) 



5.2.3 The Phase Switch estimator 

We saw in section l2'.4.7l that for the PS method 3lsyi may be evaluated by appeal to Eq. 12.801 This identity corresponds to a PS 
simulation in which the probability of switching phases is given by Eq. 12.751 In this case it is clear that a PS will have a non- 
negligible chance of being accepted only around the Mba ^ 0{1) regions, and therefore the MUCA sampling distribution 
should ensure that these regions are visited by the simulation. In this case (see section 12.4. 7> a suitable MUCA distribution 
P{MbaWps) is one which is flat and which contains both P{Mba\'^a) P{Mba\t^'b) Q, IED-OtI. If H{Mba,iWps) 
denotes the number of data entries falling in bin Mba,! under the MUCA-PS sampling distribution TTp^, then the estimator 
for 3?3yi is given by 11501 : 

e^. Etl/(^W,.)P(^W,.k|.s) ^3 

Y!l=l.f{~MBA,i)P{MBAA'^^Ps) 

where: 



t>(M N ±±yiviBAM"PS)'' i^n^ 

P{MBA,i\^Ps) = —b — 7777Z 1 ^ X ^^.,^,r^ . .^ (^■'-' 



H{MBAA^'^s)e'^ ^'^^'^^-^ 
EtiHiMBAA^psy'"'"'-^'^^"^ 
where rjpsiMBA) is the associated MUCA weight function. Like the EP method, the essential feature of the MUCA distri- 
bution P{M baIt^j^s) is i*- contains both P{Mba\'^a) ™d P(Mba Iti"!;)^ so as to account for all the regions of (effective) 
configuration space over which the weights Wn{MBA) (see Ea. l4.48t and WdiMsA) (see Eq. 14.491 1 are significant. 

5.2.4 Numerical results 

Figure 1511 shows an illustration of the application of the MUCA strategy to the EP and the PS methods. Figure (b) shows 
the MUCA-PS distribution which allows switching between the two phases. Figure (a) shows an estimate of the canonical 
distribution P{Mba\''^'ps) as obtained from Eq. 15.71 Since the two peaks in figure l5Jl (a) do not overlap, one may (by 
virtue of Eq. I2.76t think of these peaks as effectively corresponding to (scaled versions of) the phase-constrained distributions 
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P{Mba\'^'a) ^^'^ ^(^^baKb)- From this figure it is clear that the (canonical) probability of the simulation visiting the 
Mb A ^ 0(1) regions is negUgible, and it is for this reason that one must sample from the distribution shown in figure ISTI 
(b) in order for the simulation to be able to switch phases. Figure l5m (c) shows the MUCA distribution (for a simulation 
initiated in phase B) that is required in order to ensure that the EP estimator (Eq. 12.261 1 is free of systematic errors. Figure l5!T] 
(d) shows the convergence of the FED per particle as the MUCA-EP distribution P{MBA\'^ij ) is extended so as to include 
increasing proportions of the distribution associated with phase A . It is clear that convergence is obtained in the limit of the 
MUCA distribution PiMBAlT^s) containing the whole of P{Mba\t^a)- 

The underlying feature of the form of the two MUCA distributions (Figure IsTTl l'b') and (c)) is that they both 'contain' the 
two canonical distributions P{Mba\t^a) ^'(^^saIti'b)- The difference lies in the way in which they achieve this. In the 
case of the EP method, one employs a single sampling distribution tt^. In the case of the PS method, one employs either 
7r™(v, A) or 7r™(v, B), depending on which phase the simulation is in. This difference manifests itself in the range of Mb a 
space over which multicanonicalisation must be performed. Whereas in the case of PS method one explicitly constructs the 
weights (via Eq. \5.3l over the region of M ba space lying between the maxima of the two peaks, in the case of the EP method 
one performs enhancement on the whole region between the maximum of the peak of the parent phase (left hand peak in 
Figure I5T] (a) 1 and the tail (and not merely the peak) of the conjugate distribution (right hand peak of Figure l5?T] (a)). 

The reason for this can be understood as follows. When the PS method 'switches phases', it switches the sampling 
distributions to that which would naturally lead to the exploration of the conjugate phase, even without the aid of MUCA 
weights. As a consequence the role of multicanonicalisation is merely to ensure that the Mb a ^ 0{1) regions are accessible 
to simulations initiated in either of the phases. This entails the peak-to-peak reweighting, which is evident in the MUCA 
sampling distribution shown in figure lOI (b). In the case of the EP method, the canonical sampling distribution (which 
in our case is tt^ and is associated with the Mba < regions) is fixed and is ill-suited to sampling of the regions of 
(effective) configuration space associated with the conjugate phase (which in our case is phase A and corresponds to the 
Mba > regions). As a consequence the MUCA weights must not only take the simulation to the Mba ^ 0(1) regions, 
but must also force the simulation to visit the entire region of (effective) configuration space relevant to the conjugate phase 
{AI BA > regions), since the sampling distribution tt^ will typically try to direct the simulation back in the direction of AIba 
space associated with the parent phase (Mba < regions). It is for this reason that the EP method requires the additional 
construction of multicanonical weights (Eq. \5.3t over the regions spanning from the maximum of the distribution of the 
conjugate phase to its tail (compare figure Ism fb) with (c)). 

This difference manifests itself in the MUCA weights. Figure]^] shows a comparison of the MUCA weights for the two 
methods. It is clear that for the AIba < regions, the MUCA weights for the EP and PS methods are the same; the reason 
for this is that the canonical distribution associated with these regions is tt^ for both methods. This property holds until the 
AI BA ^ regions. For the Mba > regions, the profiles of the two weight functions diverge. In the case of the EP method 
the weights decrease as AIba increases, whereas the weights of the PS method increase before levelling off. The reason for 
this is due to the switching of the phases that takes place in the PS method. That is for the AIba > regions, the probability 
of a switch of phases being accepted will be unity. On switching phases the PS simulation will naturally explore the AI ba > 
regions, even without the aid of MUCA weights. The presence of weights in the AIba > region is merely to guarantee that 
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Figure 5.1 : MUCA Strategies 

a) The canonical probability distribution P{MBA\'n''ps) for the PS method 

b) The MUCA probability distribution P{MBA\Trps) for the PS method 

c) The full MUCA probability distribution for the EP method initiated in phase B (left hand peak in (a)) 1 151 1. The MUCA 
distribution has been constructed in a way which ensures that a simulation initiated in phase B is able to visit all the regions 
of (effective) configuration space associated with phase A (right peak in (a)). 

d) The convergence of the FED per particle for a series of MUCA simulations (initiated in phase B) as an increasing proportion 
of the conjugate distribution (right hand peak in (a)) is included in the MUCA probability distribution. The horizontal axis 
measures, as a fraction of the distance from the maximum of the left hand peak in (a) to the tail of the right hand peak in (a), 
the distance up to which the weights satisfy the relation in Eq. 15.31 

T* = 1.0, RSM. 
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Figure 5.2: Comparison of tine EP and PS - MUCA weiglnts 
The MUCA weights as a function of Mb a for both the MUCA-PS method and for a MUCA-EP simulation initiated in phase 
B. The inset compares the actual MUCA weight function for the PS method ?7ps(Mb^) with riB{MBA) + + e^^^-^J. The 
agreement between the two is in accordance with Eq. 15.111 
T* ^ 1.0, RSM. 



the simulation is, once it has jumped from phase B to phase A, able to come back at a later time from the Mb a > regions 
to the Mba ^ 0{1) regions, so as to allow the simulation to switch back to phase B, which will then allow it to naturally 
explore the Mba < regions once again. On the other hand in the EP method the simulation will have to be forced to visit 
the Mba > regions of (effective) configuration space; figure l?!2l clearlv illustrates this. 

On the transition to larger system sizes, the differences seen in the MUCA distributions of the EP and PS methods become 
less noticeable. The reason for this lies in the ways the means and the spreads of the peaks scale with the system size. Since 
for each peak the mean will scale as N, whereas the standard deviation (which measures the spread) scales as \/iV, we see that 
the additional amount of Mba space which will require reweighting in the case of the EP method, over that of the PS method, 
will become smaller as a fraction of the peak-to-peak distance (which scales as N), on the transition to larger system sizes. 

For finite systems there will also be a difference between the MUCA-EP distributions associated with a simulation con- 
strained to phase A as compared with one constrained to phase B. In the case of a MUCA-EP simulation constrained to phase 
A, the MUCA reweighting will now need to be performed from the maximum of the right hand peak in figure I5m (a) (which 
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is now the peak corresponding to the parent phase) to the tail of the left hand peak (which now corresponds to the conjugate 
phase). This asymmetry that appears, due to the peak to tail reweighting, will disappears on the transitions to larger system 
sizes, for the same reasons cited above. 

The intrinsic similarity of the MUCA distributions indicates a connection between the MUCA weights of the different 
methods. For example by appeal to the overlap identity (Eq. 12. 2U we see that: 

3?23yi/'(AfBAki?)e"«(*^«^) cx e-*^«-^P(MBAk:?)e''^(*'^«-^) (5.8) 

Assuming that the two MUCA distributions are flat, one may infer that for the regions over which the two distributions overlap 
fT52l : 

Vb^Mba) = -Mb A + Va{Mba) + constant (5.9) 

Similarly the fact that the MUCA-EP and the MUCA-PS distributions are approximately the same means that one may arrive 
at a similar identity relating the MUCA weights of the two methods. To arrive at the result we substitute Eq. 14.371 into Eq. 
I4.45l (in which the weights w-y have been set to be equal) so as to obtain the following relation: 

^^^^^^'^^^Z;T1; = ^ + ^~M,^ P(MBA\nj.s) (5.10) 

Using Eq. l5.2l one obtains: 

, j VA{MBA) + Hl + e-''^--]+L, 
riPs{MBA) = < (5.11) 
[ VB{MBA) + Hl + e^'^^]+L2 

where Li and L2 are some constants. This identity also naturally follows from Eq. 12.731 Since these constants do not affect 

the simulation in any way (since it is only the relative values of these constants that matter) we may, without loss of generality, 

set these two constants to zero. The inset in figure IS^ shows a plot of rjps[MBA) and a plot of risiMBA) + + e^^^^], 

where both rips{MBA) and rjB{MBA) have been estimated via simulation. The clear agreement between the two curves 

verifies that the relation in Eq. l5.11l does indeed hold. 

So far we have noted that the EP and PS method are different in two respects. Firstly they require different ranges of 

Mb A space to be reweighted, and secondly the MUCA weights are different. However these differences mask the underlying 

similarity of the two methods. The first difference, that is the difference in the range of Mb a space which requires reweighting, 

vanishes on the transition to sufficiently large system sizes. The second difference merely arises from the fact that the canonical 

samphng distributions are different. However since the MUCA distributions are the same, and since the weight of macrostates 

are proportional (compare Eq. 14. 141 to Eq. 14.481 and Eq. l4.15l to Eq. I4.49> . we see that the methods are essentially identical. 

This equivalence between the two methods may be most readily expressed through the corresponding estimators: 



CHAPTER 5. SAMPLING STRATEGIES 



115 



Y!l=lf{MBA,i)H{MBAA^PSy'""^''''^"^ 
Etji~MBA,:)H{MBAA^fs)^'"'''^^^^"^ 

EtJi~MBA,^)H{MBAA^Ps)e''^^^^^-Hl + e-*^«^.'] 



(5.12) 



where we have used the fact that the MUCA distributions P(AfByi [tt^) and P{M ba^ps) ^re the same, so that H{M ba,^^) oc 
H{MBA,i\'^f}s)- ^1- 15 .121 establishes the equivalence between the PS estimator (Eq. 15. 6> to that of the EP estimator (Eq. 
l238t in the MUCA limit. 

In the absence of MUCA weights, the two estimators are no longer equivalent. The reason for this is because now the 
Mb A distributions are no longer the same, even though the weights remain proportional to each other As a result the statistical 
errors will be different for the two methods in a finite run simulation. 



In the previous section we have seen that the MUCA strategy provides an efficient framework (in the case of zero equiUbration) 
for tackling of the overlap problem. The basic idea is to construct a sampling distribution which contains the two phase- 
constrained distributions P{Mba\t^a) PiMBAlT^s) ^How for the construction of a path (in a piecewise but serial 
manner) from the region of (effective) configuration space associated with phase A to that of phase B. This allows one to 
determine the weight of the typical macrostates associated with phase B relative to those of phase A. The important thing 
to notice is that this path can also be constructed in a parallel manner (that is piecewise and independent fashion). This is 
effectively what the MS and WHAM methods do 11531 and was originally proposed by Geyer I154I - I157I . The essential 
ingredient of all these methods is that the independent simulations overlap in some region of the (effective) configuration 
space that they explore. It is only when they overlap that the data of a simulation obtained with tt, say, may be reweighted 
with respect to tt, which overlaps with tt, so as to yield a set of macrostates whose relative probabilities are in agreement with 
TT and which, at the same time, extend outside the range normally explored by tt (see Eq. I1.32t . In this way one may use 
the idea of reweighting to estimate the probabilities of the regions of (effective) configuration space typically associated with 
phase B in relation to those of phase A. 

With this in mind let us proceed to construct a new way of estimating the FED based on the idea of simulating several 
independent systems. Consider the construction of a chain of configurational energies, as has been done in Eq. 12.441 whose 
associated sampling distributions (Eq. 12.451 1 overlap in a manner so as to yield a path connecting the two regions of (effective) 
configuration space associated with the two phases. Then instead of writing the ratio of the partition functions as has been 
done in Eq. 12.461 one may instead choose to write it as: 



5.3 The Multihamiltonian strategy 



5.3.1 Theory 
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3? 



■/9£a„(v) 



f rr"-ldv e-''{^^2(^l) + £A3(v2) + + £a„(v„_i)} 

•' (5.13) 



or: 



where V = {vi, V2, v„_i} denotes the collection of the configurations of the n-1 independent replicas, and where: 



0<,iV) = \ ^ if^ = ^ (5.15) 

I E;Ci£a,,,(v,) : if7 = B 

By writing JI-ba as has been done in Eq. 15.141 a new strategy immediately becomes apparent. That is rather than simulating 

the actual systems with the configurational energies £^ and £ b (see Ea. l2.13> one may instead simulate the composite systems 

described by the extended configurational energies and Jis- If one now generahses the PM operation from the original 

version: 

A'^B v^v (5.16) 
to that of a PM between the composite systems in which the extended configuration V is matched for the two systems: 



A^B V^V (5.17) 

then it is clear that the array of estimators and techniques used to estimate Eq. I2.13l mav also be used here. The key idea is that 
the independent sampling distribution {ttauTTAj, ...,7r\^} provides the necessary extended sampling strategy that is needed 
in order to overcome the overlap problem. The greater the number of configurational energies in the chain, the greater is the 
weight of the set of configurational energies {£a2 j , £a„_i } that the two hamiltonians and 1Kb share, and therefore 
the greater is the overlap between the effective configuration space of the two composite systems. 
In order to be able to quantify the overlap, it is useful to once again define a macro variable: 

Mba(V) = mB(V) - JCa(V)] (5.18) 

Mb A essentially corresponds to the (temperature scaled) work (which, as before, we will subsequently refer to as work) in- 
curred in switching (in an instantaneous fashion) between the extended configurational energies 3-Ca and b whilst preserving 
the extended configuration V. In the case of the linear parameterisation given in Eq. 12.481 Eq. I5.18l mav be written as: 



ri-l 

Mba{V) = ^AA,Mba(v.) 



(5.19) 
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In order to quantify the overlap in a meaningful way one must be able to relate the probabilities of a macrostate Mba 
associated with one system relative to that of the other. Suppose that: 

~c^g-/3M-,(v) (520) 

denotes the sampling distribution of the composite system (which may be realised by independently simulating the n-1 sam- 
pling distributions {ttx.}). The procedure of sampling via tt^ will be referred to as the multihamiltonian (MH) method. 
Suppose that P{MBA\T!'!y) denotes the probability of obtaining the Mba when sampling with tt!^ . It immediately follows 
from the form of Eq. 15. 141 (compare this to Eq. I2.13> that the probability of obtaining Mba (as defined in Eq. I5.18> when 
sampling with tt^ relative to that when sampling with respect to tt^, is simply given by the overlap identity (Eq. I2.1Q1> . To 
see this more explicitly we observe that: 



n-1 



P{Mba\^a) - f S{MBA{V)-MBA)e~^^"-^'^'^'^''^^Y[d^, 

SiMBAiV) " MBA)e-~^^^='' X 



nf 

^=l 



n-1 



n—1 ^ „ n—1 

/ S{MBA{V)-MBA)e-^'^'-^''^^^^^'''^e''--^^^l[d^. 

-^^1 r=2^>'i i=l 



= eMi.^|£p(MBA|^^B) (5.21) 

^A 

which is the overlap identity given in Eq. 12.1011 Once again the distribution of P{Mba\t^'^) will look something similar to 
that shown in figure l4~2l with the point of intersection being located at AIba = Af,,, . We note that this also follows from the 
fact that the MH method can be viewed as a limiting case of the EG method (see appendixO. 

The crucial point to reahse is that as the number of configurational energies in Eq. I5.15l increases. the overlap O increases 
and tends to unity. We recall that heuristically this may be understood by noting that the greater the number of replicas, the 
greater is the weight of the set of configurational energies {£\2 ' £^3, £a„_i } that the two hamiltonians 3-Ca and 3is share 
(see also fi sure ISJl for an alternative explanation). In the case of the n=2 this set has zero weight. As n ^ oo, the weight 
of this set dominates over the configurational energies £ai and £a„ which are at the edge of the chain of the configurational 
energies in Eq. I2.44l and which describe the two phases whose FED one is trying to measure. It is these edge configurational 
energies that give rise to the difference between and Kb (see Eq. I5.15> . In the limit of the number of configurational 
energies in the chain tending to infinity, one may write: 
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< Mba(V) = Yj< ~ £a,(v,) >^c 



n-1 



1=1 
A=A„ 



A=Ai 



[£a,+i( v») - £a. (v,)] 
AA„. 



dA < >n (5.22) 
oA ^ 



< Mb Am >*^« / d\<—^ (5.23) 



Similarly, it follows that: 



Therefore in the limit of the number of configurational energies in Ea. l5.15l tending to infinity, we find that < Mb^(V) >#c 
approaches the value obtained by thermodynamic integration: 

lim < Mba(V) >sc^ / dX<—^ >,c= - (5.24) 
It follows that in this limit Mb a must have the distribution corresponding to perfect overlap: 

P{MbaK) = 5{Mba + In3l23yi) (5.25) 

Comparing Eq. 15.141 to Eq. 12. 131 it is immediately apparent that the array of estimators as parameterised by Eq. 12.341 are 
available for the task of estimating FEDs. More generally one could also perform an arbitrary switching FG process in which 
!H-y is gradually switched into so as to allow an estimate of the FED to be obtained either from the phase-constrained 
methods Eq. \2.101\ Eq. [4.20\ or from the PS formulae (Eq. [4.471 . For example in the case of zero-equilibration the MH 
version of the PS sampling distribution is given by: 

g-/3M^(V) 

^P5(V,7) = ^- (5.26) 

WA^A + Wb^B 



where is the weight which biases phase 7, and 7 is a stochastically sampled variable (see section l43> . 'R'ba may then be 
estimated either via Eq. l4.43l or Eq. I4.47l ('where Wba= M ba)- 

Inspection of the MH sampling distribution (Eq. 15.2m may lead one to believe that the MH method is equivalent to the 
MS method. This is generally true if one can write Eq. I2.34l as: 

■J<-'ZA = — 



< G(/3[£b - £a]) 

nr=i'<G(/3{£A.+.-£A. 

which only holds if G(x) is of the exponential form: 

Gix) = e"^ (5.28) 
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where a is some constant. Therefore the only choice of G{Wba) for which the MS and MH methods are equivalent is the EP 
(Eq. I2.26> method. In this case the only difference that arises for the two methods is that in the case of MH estimator one 
only deals with a single estimate of the error, where are in the case of the MS method one must combine n-1 such errors in 
determining the error of the final value for the FED between the two phases. For other estimators (including the PS estimator) 
the two strategies are not equivalent. 

We finally note that the potential for parallelising the sampling of Eq. 15.201 is a clear advantage of the MH method. 
Furthermore, in comparison to the WHAM method, the storage requirements are far less. If one records the value of AIba 
during the course of the simulation (as one would do if one wanted to avoid the systematic errors introduced by employing 
finite bin width histograms) then the use of the MH method will yield significant gains in regards to this issue, since for this 
method one will need to record only a single temporal sequence of M^^'s, as opposed to recording one such sequence for 
each and every replica. 

5.3.2 Numerical Results 

In this section we illustrate the application of the MH method to the systems investigated in this thesis. Fi sure l5Jl shows the 
distributions of the macrovariable Mb a for the replicas of the chain of configurational energies (Eq. 12.441 Eq. I5.15t in which 
n=7 and in which the configurational energies are linearly parameterised as prescribed in Eq. 12.481 The crucial feature of this 
figure is the way in which the distributions overlap, so as to provide a continuous path (in Mb a space) from the region of 
(effective) configuration space associated with phase A (right hand most peak in figure l5Jt to that of phase B (left hand most 
peak). Figure 15341 shows the corresponding MH-PS distribution which employs these replicas in the sampling distribution 
Eq. 15.261 and shows how the MH-PS method is able to effectively overcome the overlap problem (notice that the overlap 
between the phase A and phase B, the right and left hand most peaks respectively, do not overlap at all). Fi gure l53l shows the 
probability distributions of Mb a for the MH-PS method (Eq. I5.26> for different n. It is evident that, as the number of replicas 
n increases, the overlap increases, with the distribution P{Mba\t''ps) tending to the ideal limit of a delta function centred on 
Mm, which follows from Ea. lZ76l andEa. l5^ 

5.4 The Fast Growth strategy 

An alternative to the MUCA and the MH strategies is the EG method (see section (2.4. 8> . Under this scheme, one performs 
work on the system so as to morph the configurational energy from that of phase 7 (£^) to that of phase 7 (£7). In the 
process a path is constructed linking the set of macrostates associated with phase A to those associated with phase B. The 
key parameters in the method are the increments {AA^} of the field parameter A and the equilibration times {Ati} (where 
Ati = ij+i —<,:). For simplicity, we will limit ourselves to the case where all the increments are equal , that is AA^ = A A, and 
when all the equilibration times are equal, that is Ati = At. It is only if the equilibration times At are sufficiently long and 
the perturbations to the configurational energy (AA) are sufficiently small that one is able to construct a path (an overlapping 
sequence of macrostates) connecting the two phases. 

One may immediately identify two limiting cases. In the case where At = 0, one obtains the zero equilibration methods. 
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Figure 5.3: Probability distribution functions P{MBA\T^\i) 
The figure shows the probability distributions P{MBA\'^i) (where tt^ = Tr\., see Eq. I2.45t for the replicas constituting 
the chain of configurational energies (Eq. 12.441 Eq. I5.15t linking the two phases. In this figure we have chosen the linear 
parameterisation (Eq. I2.48> and set n=7. By constructing a series of independent and overlapping distributions, so as to bridge 
the regions of (effective) configuration space associated with the two phases, one overcomes the overlap problem. More 
specifically, since for some of these distributions Mb a > and for others Mb a < 0, one obtains cancellations in the overall 
summation given in Eq. 15.191 As a consequence the distribution of A/sa(V), as given by Eq. 15.191 will reside between the 
peaks associated with the two phases, resulting in improved overlap between P(MByi |7r^) and P{Mba\''^b)' compared to 
that between PiMBAl^A) and P{Mba\tt%)- 

See figure lS^l fa) for the PS distribution for the corresponding composite system described by Eq. 15.151 
T* = 1.0, RSM. 
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Figure 5.4: P{Mba\ttps) (MH-PS method) corresponding to fiaure l5.3l 
The figure shows the probability distribution P{Mba\t^ps) obtained via the MH-PS method in which the distributions of 
the composite replicas {tt\^} is given in figure l5Jl 
n=7, A A = 1/6, T* = 1.0, RSM. 
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Figure 5.5: MH-PS Probability distribution functions of Mba for different n 
This graph shows the probabihty distribution P{Mba \t^ps) foi" '^he MH-PS for n= 4, 10 and 50. The corresponding probability 
distribution for the original PS simulation (n=2) is shown for comparison. As the number of replicas n is decreased, the 
probability distribution of Mba tends to that of the original (n=2) PS simulation. However as the number of replicas is 
increased, the two peaks associated with vr^ and tt? increasingly overlap, thereby increasing the chances of a switch being 
accepted (see Eq. 12.871 and notice that Mm ^ for the systems investigated here). 

It is clear from the figure that the chance of a switch taking place is negligible for the original PS method unless one employs 
some form of extended sampling. The MH method offers an alternative extended sampling strategy to that of the MUCA 
method. The advantage that the method has is that it is highly parallelizable. On the down side, it makes an increasing 
demand on the memory requirements since one will have to store n-1 replicas of the given system in the computer memory. 
We note that it was found that the errors in the estimate of the FED obtained in a finite run were independent of the number of 
replicas chosen. 
T* = 1.0, RSM 
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irrespective of A A, as one does in the case where AA = 1 f55V In the limit where AA ^ 0, At > (which we will refer to 
as adiabatic equilibration since such a method takes an infinite time to switch from the configurational energy of phase A to 
that of phase B), one obtains the thermodynamic integration method (Eg. 12.521 f55l: 

lim < e'^-- >,c = 6-/3 'iA<^>,, (5 29) 

AA->0,At>0 ^ 

Both limits have undesirable features as they stand. On the one hand the choice of zero equilibration induces systematic errors 
(as described in section l2'.4.2t which must be overcome by appeal to some form of extended sampling strategy ( 1211 . section 
15.21 section l5Jl . On the other hand the choice of adiabatic equihbration is time consuming. Furthermore it is not clear how 
one accounts for the systematic errors induced in making an approximate evaluation of the integral in Eq. 15.291 Generally 
some intermediate strategy is preferable in which Eq. 14.31 Eq. I4.47l is used to estimate ^-ba- 

We will now focus our attention on such intermediate strategies. Namely we will investigate the way in which the FG 
method overcomes the overlap problem for these intermediate strategies through control of the parameters A A and At 11581 . 
In particular we will examine three variations. In the first case we will keep AA constant and vary At. In the second we keep 
At constant and vary AA. In the third we investigate the case when At/ AA is held constant. Since the total time allocated to 
obtaining each work term Wba is given by {n — 1) At = At/AA, we see that the last case corresponds to the variation of the 
parameters AA and At so as to ensure that the amount of time allocated to performing work on the system is held constant. 
In all cases we will demonstrate the improvement in the overlap by demonstrating the convergence of the FED estimate as 
obtained by the EP estimator (Eq. 12.1031 1 relative to that obtained via the MUCA-PS method. Again we illustrate the overlap 
by examining only one of the phase constrained distributions, which in this case is P{Wba\t^a)^ since its conjugate partner 
^'IW^baKb) is roughly symmetrically positioned about the origin (which is where Wm, see figure l4~2l roughly lies). An 
example of this approximate symmetry is illustrated in figure ISTSl The particular parameterisation of {£.\. } that we employ is 
the linear parameterisation given by Eq. 12.481 

5.4.1 Keeping AA constant, varying At 

Figure (a) shows the probability distribution of Wba as the increment A A is kept fixed but the equilibration time At is 
varied, and figure l5^ (b) shows the estimate of JIba (the normalised value of ^-ba with respect to the corresponding value 
as obtained by the MUCA-PS simulation) as a function of At. 

Figure l5^ (a) clearly shows that as the equilibration time At is increased, the mean of the distribution P{Wba\t^a) 
its associated variance both decrease. To understand this we first note that each time a work increment SWBA,i is performed, 
a lag develops in the ensemble of configurations immediately associated with the system after this operation. Namely, 

when the switch from a configurational energy Ex. (v(i)) to Sa^+i (v(«)) is made, the configuration v{i) will not be typical 
of the set of configurations associated with ttx^^-^. Furthermore this lag accumulates as one performs the FG process. A 
consequence of this is that the distributions of the energies (v(i))} associated with the system immediately after its 

configurational energy has been incremented from to £Ai+i will not be the same as the equilibration distribution which 
we denote by {£Ai+i (^(i))}^. Typically the values within the set {£Ai+i will be higher than those of {£Ai+i 

Increasing the equilibration time At decreases this lag, and this is precisely what is observed in Figure IS/Tl Ca^. 
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Figure 5.6: Symmetry of P{Wba\tt%) and P{Wba\7Tb) 
The probability distribution of Wba for the A B process (in b), in which A is increased from to 1, and for a B ^ A 
process (in a), in which A is decreased from 1 to 0. 

The equilibration time is expressed as a multiple of lattice sweeps. In other words, a single lattice sweep, in which one 
attempts to sequentially perturb all N particles, corresponds to an equilibration time of At = 1. 
At= 1,T*=1.0, n=ll,RSM. 
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Figure 5.7: Convergence of ^ba with increasing At , AA= constant 
Figure (a) shows the distribution -P(Wsyi Itt^) as the equihbration times At is increased whilst AA is maintained at a constant 
value. 

Figure (b) shows the convergence of JIba as At is increased. 

JIba = '^ba/^*bA' where '31% is the value obtained by the MUCA-PS method and where JI-ba is estimated by the EP 
estimator (Eq. I2.103> . 
T* = 1.0, RSM. 




Figure 5.8: Convergence of ^Iba as the equilibration time At is kept constant but as AA is varied 
Figure lsTSl fa) shows the variation of P{Wba\t^a) as the equilibration time At is kept fixed, but as the increment AA is varied. 
Figure lsTsl^ b) shows the associated convergence of JIba- 
See figure lS^ for the definition of ^ba- 
T* = 1.0, RSM. 



Further insight into the workings of the FG method may be obtained by noticing that the components SWBA,i = A\MBA,i 
of the overall work term can be both positive and negative. Suppose that one performs a FG simulation in which the 

canonical distribution of the 'intermediate' stages are given by those shown in figure 1531 For zero equilibration time (Ai= 
0) 5WBA,i = AAAfB^(v(l)) for all i, where Mba{'^{^)) is the starting value of Mba, namely MBAi^i^))- Since this 
corresponds to a value of Mba chosen from P{Mba\t^a) ('^^e right hand peak of l5.3t we see that SWbaa will be almost 
always positive. Suppose that P{MBA\ti) denotes the probability distribution of Mba at time ti, when the configurational 
energy has been incremented from £a,_i to E\. and after the system has been equilibrated with tt^ for a time At. Then 
as the equilibration time At increases, the distributions P{MBA\ti) will shift from the right hand peak in figure l531 towards 
the left. In the limiting case of At ^ oo one will find that P{MBA\ti) ~* P{MBA\T^\i), so as to yield the collection 
of distributions in figure 1531 111 59i . Therefore we see that the increase of the equilibration time At will eventually lead to 
significant cancellations between terms in Eq. 12.941 resulting in a decrease (on average) of Wba from the value it assumes in 
the case of zero equilibration. As a result one obtains improved overlap between P{Wba\t^a) ^^'^ ^(W^saK^), resulting in 
the convergence of ^JI-ba in the limit of large At, as can be seen from figure l5^ b) (note that the convergence is conditional 
on A A being small enough). 
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5.4.2 Keeping At constant, varying AA 

Figure ISTSl fb) illustrates that as the equilibration time At is kept constant, whilst A A is decreased, 3^syi converges as the 
overlap between P{Wba\''^'a) ^i^^ P{Wba\''^b) increases. In order to understand this consider the following argument. 
As AA decreases, less equilibration time is needed between successive work increments, until eventually Ai matches the 
equilibration time 'needed' in order for the lag to be absent so that P{MBA\ti) = P{MBA\T^\i)- In this case the distribution 
of Mba at each timeslice will be something reminiscent of what is shown in figure l53l Subsequent decrease of A A will 
just correspond to increasing the number of configurational energies in the chain of Eq. 12.441 which will lead to 'better' 
cancellations in Eq. 12.941 thus taking P[Wba\''^b) closer to the ideal limit in Eq. 15.251 Eventually the overlap between the 
two phase-constrained distributions will be sufficiently great so as to ensure the convergence of '51%^ even when estimated via 
the EP method (Eq. I2.103> . as is clearly verified in figure ISTSl 

Let us now analyse the behaviour of the statistical and systematic errors for the FG-EP estimator in the context of the 
systems that we have studied. Insight into the interplay between statistical and systematic errors may be obtained from figure 
15.81 (b). It is clear from this figure that whereas for large AA systematic errors are present (since 'JIba ~ 0), for small AA 
they are absent (since 'Rba ~ !)■ In between these two limits, one finds that as AA decreases, the systematic errors decrease. 
The behaviour of the statistical errors, on the other hand, is quite different. In this case the statistical errors are small in both 
the large AA and small AA limits, and in between these limits there is a transient regime where the statistical errors greatly 
increase. This is merely an artifact of the EP estimator, and may be understood as follows. 

In the case of negligible overlap the weights (see Eq. I4.14> of the macrostates actually sampled are small in 

value in comparison to the macrostates which contribute most significantly to the numerator of Eq. 14.131 As a consequence 
the variance of the estimate of 'R'ra. (see Eq. I4.13> will be small, since the estimate of iK^yi will itself be small. As the overlap 
increases, one eventually enters aregime where the main body of P{Wba\t^%) resides within the tail of P{Wba\''^'^a^- In this 
case, the macrostates which contribute significantly to the numerator of Eq. 14. 131 will originate from the tail of P{Wba\''^'x), 
and as a consequence their statistics will be bad, resulting in large statistical errors in the FED estimate, despite the absence 
of systematic errors. As the overlap improves more of P{Wba\''^'b) gets contained in the main body of P{Wba\t^'a)^ ^nd as 
a result the statistical error in the estimate of the FED improves. 

5.4.3 Keeping ^ constant 

We also considered the convergence of "JIba as AA is varied, given the constraint that the total time spent obtaining each work 
term Wba(*) is kept constant. Since corresponds to the number of times the configurational energy £a is 'perturbed' the 
constraint of keeping the time (tw) spent obtaining each work term WBA{i) constant corresponds to: 

At 

^ - tiy (5.30) 

It is clear that if AA is too large, then one will approach the limit of zero equilibration, resulting in the appearance of the 
systematic errors described in section l2'.4.2l This is exactly what is observed in figure ls!9l since as AA 1 'JIba ^ 0. As 
A A is decreased these systematic errors vanish, resulting in "JI-za converging to unity II 160I . 
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Figure 5.9: Convergence of JIba as a function of AA given ^=constant 

At/ AA corresponds to the time spent obtaining each work term Wba- Therefore the parameters At and AA are varied so as 
to keep the total time expended on obtaining each work term Wba constant. 
See figure lsTTl for the definition ofJlBA- 
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The behaviour exhibited in figure ls!9l may be understood as follows. After each work increment a lag develops. This 
lag can be overcome by appropriately equilibrating the system. However the lag obtained from going from A„_i to A„ (as 
measured by the deviations of {£a„_i (v(ri — 1))} from {£a„_i — 1))}^) can only be removed by performing equili- 
bration after the Wba measurement has been made. Therefore this lag persists in the value of Wba that one obtains. As a 
consequence if AA is too large, will become large, leading to less overlap and eventually to systematic errors. 

We finish off this section by noting that it was observed from the numerical data that the errors seemed to be essentially 
independent of AA, provided AA was sufficiently small. In other words our numerical work seems to indicate that the error 
in ones estimate of "JI-ba (based on the estimator of the EP method, Eq. 12.1031 1 essentially depends on the total time allocated 
to obtaining each estimate of Wba (provided A A is sufficiently small), and not on A A. 

5.4.4 The choice of estimator 

Our investigations into the way in which the EG method deals with the overlap problem have primarily focused on the EP 
estimator (Eq. I2.103> . As we have noted in chapter@]this estimator will require a significantly greater amount of overlap 
between the phase-constrained distributions than will be the case of the dual-phase (DP) estimators (Eq. 14.31 Eq. I4.20> or the 
PS estimator (Eq. I4.47> which only require partial overlap between the two distributions. Since the application of these 
estimators is straightforward, we will only illustrate the use of the PS estimator, which has not been formulated before within 
the context of EG. 

Figure lSTOl illustrates the application of the FG-PS estimator, as described in section l2'.4.9l to the systems studied here. In 
this experiment the equilibration time At has been kept constant whilst AA=l/(n— 1) has been gradually decreased. As we 
saw earlier in section l5'.4.1l that this leads to increased overlap between P{Wba\t^a) and P{Wba\t^b)- This is also clearly 
evident in figure ISTOI since the two peaks begin to merge into one as A A decreases, and is reminiscent of what is observed in 
figure l531 Until now all EED calculations have been limited to the EP estimator and figure l5?T0l (in which the overlap problem 
has been cured) shows the scope for improvement in using more 'intelligent' estimators. 

5.5 Conclusion 

Given a choice of representation one obtains two phase constrained distributions P{Wba\t^a) P{Wba\t^b)- ^ finite 
run simulation their estimators P(W^_BA|7r^) and P{Wba\t^b) o^" ^'^^ overlap. If they overlap then one may choose 
an estimator (Eq. 14.31 Eq. 14.201 or Eq. I4.47> which yields an estimate of the FED which is free of systematic errors. In the 
absence of overlap, one must engineer overlap via one of three possible strategies: 

1. MUCA method: In this approach MUCA weights are employed to force the simulation to visit the regions of (effective) 
configuration space which it would not visit under the canonical sampling distribution and which at the same time 
contribute non-negligibly to the estimator of the FED. In this way a path is constructed from one region of (effective) 
configuration space to the other. 
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Figure 5.10: The PS method as incorporated for the FG method 
In this figure, we illustrate use of the PS method in conjunction with the FG method rsection l2.4.9l Eq. I2.113> . The probability 
distributions were obtained by keeping At constant and decreasing AA. It is clear as AA decreases, the overlap increases, so 
as to result in the convergence of the two peaks. 
T* = 1.0, RSM. 
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2. MH method: In this method one employs a series of independent simulations which overlap in the regions of (effective) 
configuration space that they explore, so as to provide a path linking the two phases. Specifically, one engineers overlap 
by increasing n, the number of configurational energies comprising the composite systems. The benefit of this approach 
is that it is highly parallelizable, albeit at the expense of additional overheads in terms of memory requirements. 

3 . FG method: In this approach the path is constructed by performing non-equilibrium work on the system so as to take it 
from the regions of (effective) configuration space associated with one phase to those of the other. One engineers overlap 
by making the work increments AA sufficiently small and by adequately equiUbrating the system between successive 
work increments. 

These extended sampling strategies may be combined in a straightforward way. For example, the MH method may be incorpo- 
rated into the framework of the FG method simply by performing work on the hamiltonians "Ka and "Kb- Other combinations 
(such as MUCA and MH) are also possible. In deciding what combinations to use, it is important to bear in mind that both the 
MUCA and FG are serial strategies, whereas the MH strategy is a parallel strategy. Since speedup offered by the MH strategy 
comes at the expense of additional memory requirements, combinations of this method with either the MUCA or FG methods 
may be an attractive option. 



Chapter 6 



Quantum Free Energy Differences 

6.1 Introduction 

Our focus until now has been limited to the classical regime of the phase diagram, and in this chapter we will concentrate our 
attention on the task of estimating the FEDs within the quantum regimes. Specifically we will use the Path Integral Formalism 
of statistical mechanics 0, II161I - II170I to map the problem onto that of determining the ratio of two multidimensional 
integrals of the form used in Eq. 12. 131 (see also 1381 . 11611 . I171I - I179I I This will make available to us the spectrum of 
methods discussed in the previous chapters. For a more rigorous and in depth presentation of the following material, we refer 
the reader to 0, (Ha, ||T68l, and itTSOl . 

6.2 Path integral formulation of statistical mechanics 
6.2.1 Quantum statistical mechanics 

We recap that in the canonical case, classical statistical mechanics 0-1161 begins with the construct of a system of N particles 
and of volume V which is weakly coupled to a macroscopic reservoir. Suppose that denotes a state of the system (which we 
call a microstate of the system) and that a"^ denotes a state (or microstate) of the reservoir Also let us suppose for the moment 
that the set of states {a^ , a^} is finite 11811 . The core assumption of classical statistical mechanics is that if the collection of 
the system and the reservoir is itself considered to be an isolated system of total energy then this collective system is 

equally likely to be in any one of the microstates {a" , cr'^) accessible to it (i.e. of energy _ff when the dynamics of this 
collective system have been averaged out over sufficiently long periods of time. A consequence of this so called 'Equal A 
Priori Probabilities' assumption and the weak coupling assumption is that one finds that the absolute probability of the system 
being in microstate at any given instant of time is given by: 

P{at) = ^e-^"(--^ (6.1) 
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where the partition function Z is given by: 

Z = ^e-'^^f'^' ' (6.2) 

i 

H is the hamiltonian of the system and J^i denotes a summation over all microstates a| that are available to the system 11821 . 
The expectation of a general macro variable (or 'observable', as will be more appropriate in the quantum case) is then given 
by: 

<0>= 1^0(a|)e-'5^«) (6.3) 

i 

In the quantum mechanical case the construct follows a similar procedure (see 13, ||6l). At the heart of this procedure is 
the realisation that because the system is coupled to a macroscopic heat bath, the system will be in one of the eigenstates of 
the hamiltonian operator of the system, and not in a superposition of states 11831 . Furthermore, in the derivation of Eq. 16.11 
the exponential comes directly from the entropic properties of the reservoir (|4l-(6|). Since the reservoir is classical even in 
the quantum formulation of statistical mechanics, we deduce that in the quantum mechanical case the probability of finding 
the system in a microstate erf is once again given by Eq. 16.11 What actually changes is, firstly, what one actually means by 
a microstate and, secondly, the link that one makes between the observable O and the microstate erf. In classical statistical 
mechanics the observable takes a precise value for each microstate, since a microstate essentially corresponds to a fixed spatial 
and momentum configuration of the system. In the quantum mechanical case it will, in the most general case, no longer be 
the case that the observable takes a definite value for each microstate, since now the microstate corresponds to a quantum 
state (or wavefunction). At best one will only be able to specify the quantum mechanical expectation of the observable with 
respect to a given microstate. Therefore in the quantum mechanical case one replaces Eq. I6.3l bv: 

< O 6 >,,,| e-^^('^') (6.4) 

i 

where < O >q^c,' denotes the quantum mechanical expectation of the operator O with respect to the i-th quantum microstate, 
or wavefunction, cr|. By comparison of Eq. I6.3l and Eq. 16. 41 it is clear that whereas in classical statistical mechanics one only 
performs one type of averaging, in the quantum mechanical case one must perform two sorts of averaging. The first average 
is the quantum mechanical expectation (with respect to a microstate) of the operator O, and the second is the averaging of this 
expectation over the quantum microstates accessible to the system. This first averaging, the quantum mechanical expectation, 
is not something which one performs due to our ignorance of the constituent system, but it is something we have to do because 
of the inherent quantumness of systems. 

To formally develop the theory let us begin by denoting the set of eigenstates of the hamiltonian operator H of the system 
(see I184II ) by {\Hi >}. The hermiticity of H will mean that eigenstates of different energy eigenvalues will be orthogonal, 
though states with the same energy eigenvalue are not necessarily orthogonal. One may, however, employ the Gram-Schmidt 
orthogonahsation procedure (see 11851 ') to construct a new set of states corresponding to the degenerate eigenvalue which 
are mutually orthogonal. Therefore there is no loss in generality if we assume {\Hi >} to be a mutually orthonormal set 
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(and therefore correspond to a basis set, see |'1841). It follows from Eq. 16. 41 that the statistical mechanical expectation of an 
observable O may now be written as: 

< O >= ^^e-'3^' < H.|6|i/, > (6.5) 

i 

where Hi denotes the energy eigenvalue associated with the eigenvector \Hi >, ^ ■ denotes a sum over eigenstates, and where 
the partition function Z is once again given by Eq. 16.21 < Hi\0\Hi > denotes the quantum mechanical expectation of the 
observable O for a given eigenstate \Hi > and accounts for the quantum mechanical properties of the system. As is the case 
in the classical formula (Eq. \6.3l the weighted summation J2i^~^^' essentially describes the coupling between the quantum 
system and the classical reservoir. 

Eq. I6.5l mav be written in a more general way as follows; 



<0> - ^Y,<H^\^e-^'"\H,> (6.6) 

i 

= ^^r{Op) (6.7) 

where p, the density matrix, is written as: 



p = e-P" (6.8) 
and where Tr denotes a trace over the matrix elements of the operator Op. The partition function may then be written as: 



Z = Tr(p) (6.9) 

The tracing operation performed in Eq. 16.71 and Eq. 16.91 has only been implemented with respect to the orthonormal set 
corresponding to the energy eigenstates. A-priori these eigenstates, and their associated eigenvalues Hi, are not known. 
Progress is made by noting that the trace is independent of the basis in which it is carried out II184I . and therefore one is free 
to choose any representation. A convenient representation is the position representation, in which case the partition function 
of Ea. l6^ for distinguishable (identical but localised) particles simply becomes: 

Z = J dr< r\p\r > (6.10) 

The position representation is useful for the simple reason that, as we will soon see, it allows one to map the problem of 
determining Eq. 16.21 onto that of determining an integral of the form of Eq. 11.21 This mapping is known as the classical- 
quantum isomorphism, and is what forms the basis of the path integral computational techniques. 

6.2.2 The classical-quantum isomorphism & the path integral 

The partition function, as formulated in Eq. 16.101 is not fully quantum mechanical in that it ignores exchange. Exchange 
(see |6l, 11651 . 11681 ) is a quantum mechanical property that arises out of the indistinguishability of identical particles 11841 . 
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111 861 . In order to incorporate this property into Eq. 16.101 one rewrites it as: 

Z = '^St I dr< r|p|Tr > (6.11) 

where denotes a permutation of the particles, denotes the sum over all such permutations, and 5p denotes the sign of 
the permutation. For bosons 5p — 1 and for fermions 5p assumes the values 1 and -1 depending on the sign of the permutation 
(0, CH). 

The expression in Eq. 16.1 H is still not suitable, as it stands, for use in simulation. What remains to be done is to find a way 
to project the density matrix operator p onto the position representation |r > so as to ensure that one is left with an expression 
involving only real numbers. To do this we first decompose our hamiltonian into the sum of a kinetic part T: 

(6.12) 

and a configurational part: 

E{v)^E{v) (6.13) 

so that: 

H = f + E{r) (6.14) 

where p denotes the momentum operator corresponding to the classical variable p, which represents the collective momenta 
of all the particles, where h denotes Planck's constant, and where m is the mass of the particle. Then the main obstacle to 
expressing Eq. 16.1 ll in terms of real numbers is the fact that T and E{r) are non-commuting operators, which means that there 
exists no basis in which T and E are simultaneously diagonal 11841 . Clearly in order to achieve our goal of recasting Eq. 16.111 
as an expression involving only real numbers, we must find a way of separating out the kinetic and the configurational terms 
in p so that we can separately diagonalise each contribution; E with respect to the position representation and T with respect 
to the momentum representation (a representation which it is diagonal in). 
The fundamental identity which allows this to be done is ( 11187111901 ): 



e--^ = e-^V^^W[l + 0(^)] (6.15) 

where: 

r = | (6.16) 

The Trotter theorem essentially states that in the limit of small r one may approximate the operator e~^^ as the product of a 
'kinetic' operator e~^^ and a 'configurational' operator e""^^. 

We may now use this (Eq. \b.l5i to write the partition function as an integral over real numbers. To do this we use the 
identity 1 = J dr\r >< r| to re-write Eq. l6.11l as: 
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Z = ^5g> f dvi < ri\e-^"e-^"....e-^"\Fri > 

= I dvidv2 < ri|e-^^|r2 X v^le-^" ....e-^"\Pri > 

= I ^'■^ < i'i|e^"''|r2 > < rp|e-"^|Pri > 

r ^ 

= Y.^^ n{<vM-^"\v,+, > (6.17) 



y " i=i 

where rp+i = Tri and represents the collective displacements of all the particles of replica system i. Applying Eq. 16.151 
to Eq. I6.17l we see that: 



lim <r,|e ^-^|r,+i > = < r,|e ^"^e ^^\r,+i > 

P— *-oo 

= e-^^^'-'+i) <r,|e-^*|r,+i > (6.18) 

where E{ri) denotes the total configurational energy of replica i. To recast < ri|e~'^^|r,;+i > we use the identity 11841 . 
(Ml: 



1 = J dp\p >< p| 
where 1 is the identity operator. Substituting Eg. |6. 19l into Ea. l6.18l vields: 



(6.19) 



<ri|e ^ |r,+i > = / dp < r^jp >< p|e ^"2™ jr^+i > 



dpe ^ < r^lp >< p|r,+i > (6.20) 



which, using the identity II 1841 < r|p >= ■^^^7=y^e''''i\ becomes: 



(-^)^ exp{--^(r,+i - r,)2} (6.21) 



and where A, is given by: 
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Eq. I6.21l represents the kinetic component appearing in Eq. 16.181 Collating the results of Eq. 16.171 Eq. 16.181 and Eq. I6.21l we 
finally see that the quantum partition function in Eq. I6.11l mav be written as the following limit: 



or 11921: 



where: 



Z= lim Zp (6.23) 

P^oo 



Zp = Z[l + (6.24) 



Zp = (^^)^e-/3^^V^y / dridr2...drpexp{-/3J{({r})} (6.25) 



and: 



P p 1 
^(i'"}) = EliA^^^'-'+i - ^*)' + (6.26) 

E'^ corresponds to the classical groundstate, £ denotes the total configurational energy minus the groundstate energy £ (r) = 
E{y) — and rp+i = Tri corresponds to a permutation of the particles in replica 1. In the absence of exchange (where 
particles are localised, so as to make them distinguishable) Eq. I6.25l mav be simplified to: 

Zp = (^^)^e-'3^° / dridr2...drpcxp{-/3J{({r})} (6.27) 
47rA„r J 



Eq. I6.27l represents the partition function of a classical system which is isomorphic to the quantum system of interest. This is 
generally what is referred to as the classical-quantum isomorphism and forms the starting point, in one form or the other, for 
the majority of path integral based simulations 11931 . 

6.2.3 Heuristics of tlie polymeric system 



The partition function of Eq. 16.251 and Eq. l6.26l contains all the equihbrium, time independent, information of the quantum 
system, and serves as the starting point for the Path Integral Monte Carlo (PIMC) methods I162I - I169I . 11941 . 11951 . Its 
usefulness lies in the fact that it maps the problem of dealing with an expression involving operators (Eq. I6.9l l onto one 
involving only real numbers. The classical system, represented by Eq. 16.251 and Eq. 16.261 can be thought of as a system of 
interacting polymers (see figure lOt with the following action: 



St = /5W 

= Sx + Sy 



(6.28) 
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where Sk is the kinetic action: 



1 



p 



Sk 



4A,r 



EC 



(6.29) 



and Sv is the configurational action; 



p 



Sv = ^T£(r,+i) 



(6.30) 



Two main parameters control the behaviour of this system of classical polymers. The first is the effective inverse temperature 
r and the second is Xq (Eq. It is the interplay between these two quantities which determines the strengths of the 

harmonic interactions in the kinetic action Sk relative to those originating from the configurational action Sv- 

This system of interacting polymers is unique (as compared to classical polymers) in that beads of a given polymer interact 
only with beads of other polymers which are in the same replica, or timeslice 11961 as we will also call it, labelled by the 
index i, via the configurational energy £ appearing in the configurational action Sv- In addition to this beads of a given 
polymer interact, through the term ^j-^(rj+i — r^)^ in the kinetic action Sk, with the two adjacent beads (of the same 
polymer) belonging to the two neighbouring replicas, resulting in a coupling between consecutive replicas (see figure lOt . 
Sk essentially contains the forces which propagate along a given polymer and Sv contains the forces which give rise to 
interactions between polymers. 

These polymers are also unique in the sense that they have a special boundary condition, namely that rp_|-i = Tri. For 
the case of distinguishable quantum mechanical particles {f = 1), the endpoints of the polymers connect to form loops. 
Distinguishability of the particles then arises from the fact that one may identify each particle with a given loop. In the 
presence of exchange the endpoints of loops coalesce with the starting points of other loops so as to form larger loops, making 
it impossible to distinguish the exchanging particles since now a single loop may represent more than one particle. It is in this 
way that indistinguishability is incorporated into the theoretical framework of the model. 

6.2.4 Temperature regimes in quantum simulations 

In the case of quantum systems one may identify three distinct temperature regimes. In the high temperature limit, the system 
resides within the classical regime where quantum effects may be safely ignored and where the particles are localised to 
regions in the immediate vicinity of their lattice sites. As the temperature is lowered, one first enters the weak quantum 
regime where quantum discreteness effects begin to become important. By quantum discreteness we mean those effects 
arising from the quantisation of energies that accompanies the confinement of particles in their interatomic potential wells. 
A characteristic of this regime is the increased amplitudes of vibrations of the particles about their lattice sites (relative to 
the classical predictions) . This is called the zero point motion and arises from the Heisenberg uncertainty principle. At this 
point the quantum effects are not strong enough to give rise to exchange, and the particles may, therefore, still considered to 
be distinguishable. As the temperature is further reduced one may enter the strong quantum regime, where exchange effects 
can no longer be ignored and where one must explicitly take into account the indistinguishability of the particles. For the 
Lennard- Jones potential one may easily identity these regimes. 
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Figure 6.1 : A schematic of tine classical polymer corresponding to the quantum system 
By using the path integral formalism of quantum statistical mechanics one finds that an interacting quantum system may be 
represented by a classical system of polymers, composed of replicas of the given system (see Eq. I6.25l and Eq. I6.26> . The 
chain linking the same particle in the adjacent replicas may be thought of as the polymer and a particle in a given timeslice is, 
in places, referred to as a bead 11961 . 

Particles within a given replica interact with each other via the classical £ giving rise to the configurational action Sv (see Eq. 
I6.30> . The quantum effects are controlled by harmonic interactions between the replicas (giving rise to the kinetic action Sk, 
Eq. I6.29> . in which a bead of a given replica interacts with beads of the adjacent replicas sharing the same bead index. 
Strictly the classical-quantum isomorphism mentioned above only holds in the limit of an infinite number of replicas. However 
in a Path Integral Monte Carlo / Molecular Dynamics simulation, one approximates the quantum system by a finite replica (P) 
classical polymer Increasing the number of replicas employed makes the approximation more accurate, but at the expense 
of increasing the strength of the harmonic interactions between the replicas. For the simulations, this means an increased 
relaxation time for the polymer and as a result a greater amount of time must be spent (over what one would expect merely by 
accounting for the increase in the numbers of rephcas) performing the simulation in order to obtain the desired estimates. 
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Consider the Lennard-Jones potential given in Eq. 13.21 The parameter e roughly measures the well depth, so that one is in 
the classical regime when the temperature is of the order: 

fcTe~e (6.31) 

In this regime the classical effects mask the quantum effects. 

As the temperature is further reduced, one eventually enters the weak quantum regime in which the typical particle energy 
is less than that of the well depth and is instead of the order of the typical phonon excitation energy: 

kTqw ^ hv (6.32) 

In this case the quantum zero-point motion effects will be important, but at the same time the exchange effects will not show 
up in the system. To determine this temperature we note that: 



v^J^ (6.33) 

TO 



where m is the mass of the particle and fee/ / is the 'force constant', given by: 



keff ^ u {r ^ a) ^ (6.34) 



Substituting Eq. I6.33l and Ea l6.34l into Eq. l6.32l one finds that: 



kT„^,^hJ^ (6.35) 



It then follows that the difference in the orders between and Tc is given by: 



^ - = (6.36) 

where D, the De Boer parameter, is given by: 

D = (6.37) 

TOCfJ^ 

As the temperature is further reduced one may eventually enter a regime where the exchange effects may no longer be ignored. 
This will happen if the de Broglie wavelength becomes of the order of the interparticle spacing a: 

A - a (6.38) 

where A is given by: 

A = - (6.39) 

P 

where p is the momentum. Since there is a minimum non-zero value that the total energy of the system can assume it follows 
that there will be a minimum characteristic value p* that the absolute value of the momentum can assume. Since for a harmonic 
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oscillator the expectation of the kinetic energy is equal to the expectation of the configurational energy, we see that this p* 
may be crudely estimated by setting: 



so that; 




(6.40) 



(6.41) 



Substituting this in Eq. 16.381 




(6.42) 



Rearranging this equation we get: 



D - 1 (6.43) 

Therefore if D is sufficiently large (obtained, for example, by having a particle of small enough mass) then an additional 
temperature scale Tqs will appear at which point exchange between particles may no longer be neglected. These temperature 
regimes are shown schematically in figure lol 

In what follows we address the effects of quantum discreteness but not those of quantum exchange. 

6.2.5 Estimating macrovariables 

In order to extract useful information from PIMC simulations, one must find the estimators for the relevant observables within 
the path integral framework. In the case of estimating thermodynamic quantities, one may derive the estimators merely by 
taking derivatives of the polymer partition function given in Eq. 16.271 For example the mean kinetic energy may be derived 
by using the following relations: 



<T >p = (j m— 

om 



3PN ^ Pm 2 

m— 1 

p-^ < Sk >p (6.44) 



m— 1 

3PN 



2/3 

where the subscript P denotes the fact that the expectation is taken with respect to the distribution associated with a P replica 
polymer (see Eq. I6.27l i. Similarly the mean total energy of the quantum system may be obtained via the relation: 
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Figure 6.2: A schematic sinowing tine different temperature scales tinat exist 
In the weak quantum regime quantum discreteness is important. By quantum discreteness we mean those effects arising from 
the quantisation of energies that accompanies confinement of particles in their interactomic potential wells; a regime in which 
zero point motion is important. In the strong quantum regime 'exchange effects' become important; particles may no longer 
be treated as indistinguishable. As an approximate guide, one may say that if D is small (less than unity) then exchange will 
not take place II197I . resulting in the temperature scales shown in (a). If D is large (greater than unity) then exchange will take 
place and one will have three temperature scales, as shown in (b). 
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<H>p 



1 dZp 

Zp dp 



E"+ <T>p + <£g>p 



(6.45) 



where < £g > is the configurational energy of the quantum system : 



< >^r^ < Sv >p 



(6.46) 



It must be noted that the estimator in EQ. I6.45l (known as the Barker estimator 11981 1 is not unique. An alternative is the virial 
estimator 11991 . Studies as to the relative efficiencies of these estimators have been made in 1 2001- 12031 . The general findings 
are that the more efficient of the two depends on the conditions under which they are used. For example in 1120 II it was found 
that for low temperature systems or systems in which the gradients of the configurational energies are high (which is when the 
quantum effects are typically more significant 12041 1 the Barker estimator is preferable. On the other hand it was found that 
the virial estimator is preferable at high temperatures or for systems with low gradients of the configurational energy (which 
is when the quantum effects tend to be less significant). In 12031 . other findings were also made, one being that as the number 
of replicas P were increased, the virial estimator eventually became more efficient than the Barker estimator. In this thesis we 
chose to use the Barker estimator, mainly due to its simplicity. 

The estimators that we have discussed thus far have all been based on finite replica approximations of the quantum partition 
function Eq. 16.111 As a result these estimates will have an associated systematic error (see Eq. I6.24> . If < O >p denotes the 
finite replica expectation of an operator O L205J and if < O >oo corresponds to the infinite replica estimate, then it follows 
M9\ that: 



Eq. l6.47l Drovides a clear prescription with which one may proceed to arrive at an estimate which is free of systematic error. 
In order to estimate the asymptotic limit < O >oo what one does is to plot a graph of < O >p versus Provided that P 

is sufficiently large, so that the corrections in Eq. 16.241 and Eq. I6.47l in which P is raised to a power higher than two may be 
neglected, the corresponding plot should yield a straight line graph whose intercept gives an estimate for < O >oo |206|. An 
illustration of this will be given in section l6r4l 

The partition function, with !K({r}) given by Eq. 16.261 corresponds to what is known as the primitive approximation (PA) 
in the Quantum Monte Carlo literature II166I - I169I . The widespread prevalence of the use of the PA in current literature is due 
to its underlying simplicity. By exploring more accurate decomposition schemes (section l6.2.6> to that used in Eq. 16.151 one 
may derive what we will refer to as the higher order approximants (HOA). These HOA methods are more accurate than their 
PA counterparts in that the error terms in equations of the form of Eq. I6.47l decav faster than 1 /P^. However they come at the 
expense of increased complexity and computational expenditure. In the next section we will briefly discuss the HOA method. 



<0>P=<0>oo +0(^) 



(6.47) 
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6.2.6 Higher Order Approximants 

The primitive approximation, leading to Eq. 16.271 with Jf given by Eq. 16.261 is so called due to the fact that the Trotter 
decomposition given in Eq. 16.151 are the simplest such breakups of the hamiltonian. Other breakups do exist, and may be 
written in the form I188I - I190I . 12071 . and are called the higher order approximants. These higher order approximants allow 
one to use a smaller number of replicas than the PA methods in order to achieve a desired level of accuracy 12081 . since the 
systematic error associated with these methods decay faster with increasing P. However these gains have to be appropriately 
balanced against the increased complexity and increased computational cost that accompanies their implementation. One such 
approximant is based on a Wigner-Kirkwood like expansion 11681 . II207I . 12091 - 121 11 ). For this method the systematic error 
in a finite replica approximation scales as -pj. The method is based on the identity II207I : 



(6.48) 



where 




(6.49) 



and: 



[A, B]= AB- BA 



(6.50) 



Application of this identity 12071 once again yields Eq. I6.23l and Eq. 16.271 where J{({r}) is now given by: 




(6.51) 



where: 




By appropriately differentiating Eq. I6.25l (with % given by Eq. I6.51> one may also obtain the estimator for the expectation of 



the kinetic, configurational, and total energies of the system: 



<T>p 



d\nZ 
dm 




+ 




(6.53) 
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<H>p = - 



1 dZp 



3PN 



P 



2/3 
(3^\ 



1 



4P2 



' < [V,;£(rO]' > +E" 



4=1 



(6.54) 



<£, >p = <:k >p - <f >p -E^ 



- pE£(rO + ^(f)^pE(^)^ (6-55) 



It can be shown 11891 . 12071 that within the HOA scheme, the finite repHca estimate of the expectation of an observable O, 
< O >p, will scale towards the asymptotic limit, < O >oo in the following way: 

< O >p=< O >oo +0(|^) (6.56) 

Similarly it is not hard to show that (Eq. I6.24> : 

Zp = Z[l + Oi^)] (6.57) 

As before we note that a finite replica estimate will have a systematic error associated with it. The extrapolation technique 
described in section l6.2.5l mav then be used to estimate the asymptotic value of the appropriate expectation. That is if one 
plots a graph of < O >p against 1 / P*, then the intercept of the graph on the vertical axis should, provided P is large enough, 
yield an estimate of < O >oc which is free of systematic errors. 

6.2.7 The classical limit 

In order to develop an intuition for the polymeric-hke system described by Eq. I6.25l it is instructive to observe the emergence 
of the classical limit out of Eq. I6.25l bv considering the interplay between the kinetic and the configurational actions 5'^ and 
Sv- To do this consider a simulation in which a sufficient number of replicas P have been employed so as to ensure that 
the quantum effects are modelled to the desired accuracy. Now consider increasing the temperature, so as to reduce r 12121 . 
The effect of this is to increase the strength of the spring constant -^j—p associated with the harmonic-like kinetic term Sk, 
resulting in increased rigidity of the polymers. This increased rigidity has two effects. The first is to make permutations, 
other than the identity permutation, increasingly unlikely. The second is to make the spatial arrangements of the particles 
of the various replicas increasingly similar (r^ w ri+i, so that ~ P^i^j ) where j denotes any replica). It is 

clear that what we are seeing is the emergence of classical behaviour, something we would expect on the transition to higher 
temperatures. That is in this limit Eq. I6.25l reduces to: 



(6.58) 
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where: 



1 3NR f 1 

G(ri) = (^-^)^ / (ir2dr3...drpexp{--— [(ri-r2)2 + (r2-r3)2 + ... + (rp-ri)2]} 



1 



(6.59) 



47rAg/3 J 

Eq. l6.6Ql is simply the classical partition function. 



(6.60) 



6.3 Quantum FEDs via the Path Integral Formalism 



By comparing Eq. l6.27l to Eq. 11.21 it is clear that the generalisation of the expression for the ratio of the partition functions 
(see Eq. ll.Sl and Eq. I2.13> so as to account for quantum effects is a trivial one (see 1381 . 11611 . lll71l - il78I V In this section 
we formulate the quantum version of the real space mapping (Q-RSM) and a quantum version of the fourier space mapping 
(Q-FSM) for estimating FEDs of phases. Though the Q-RSM is quite similar to its classical counterpart, the Q-FSM is quite 
different, and takes into account the harmonic interactions propagated by the intra-polymer (or inter-replica) interaction term 
Sk (Eq. I6.29t . In both formulations we will neglect exchange, and we wUl formulate both methods within the scheme of the 
PA. Generalisation to the case of HOA methods is straightforward, with the discretisation errors scaling as instead of 

1/P2. 

6.3.1 Quantum Real Space Mapping 

Limiting ourselves to the case of distinguishable particles, it is clear that if we have two phases A and B and we want to 
measure the quantum mechanical FEDs then from Eq. 16.271 (see also 1381 . 11611 . 11711 - 11781 ^ the ratio of the partition 
functions is simply determined by: 

RsA^e-''i^"--^"-^5lBA (6.61) 



where 12061 



and: 



31ba = lim 31ba,p (6.62) 



^BA.P^^'BAil + Oi^)] (6.63) 



^BAP = p p f (6.64) 

Jdri drpYlj^^AA[rj]exp{-J2,=i[jx-f(^i+^ - + r£(r^+i)]} 
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Following the derivation of Eq. l2.13l one may map the problem of determining the ratio of the integrals in Eq. 16.641 in which 
a single hamiltonian is employed, onto that of determining the ratio of two integrals in which the hamiltonians are different. 
To do this we express the position of the particles in terms of the displacements about some reference configuration R, which 
in the crystalline case is conveniently chosen to be the lattice sites: 

= R + (6.65) 
where the subscript i denotes the replica. It then follows that in the u representation the ratio in Ea. l6.64l mav be written as: 

Jdn, dupe-/^^^(W) 

^^^'^ - Jdn, dupe-/5«-(W) ^^-^^^ 

where: 

5{^({u}) - ^[ (u,+i - u,)' + r£^(u,+i)] (6.67) 

This mapping (which we call the quantum RSM, or Q-RSM) is one in which one simultaneously maps the displacements of 
the particles of each and every replica of phase 7 onto the corresponding replica of phase 7: 

Rj, Ui R^, Ui for all repUcas i (6.68) 

All the simulations performed in this chapter were implemented via the Q-RSM. The crucial feature of the Q-RSM is that 
since: 



— !— (r,+i -r,)2 = ^_(R-R + u,+i -u,)2 

4A5T AXqT 

1 ' 



4A,T 



(u,+i - u,)' (6.69) 



the kinetic part of the action 5*;^ is the same in the two phases under the operation of this mapping. This is a significant 
advantage of this particular mapping since in the large P limit the kinetic action Sk dominates over the configurational action 
Sv (see appendix!!}. 

Following the development of earlier chapters one may proceed to define a macrovariable which measures the energy cost 
of mapping the configuration of the polymeric system associated with phase A onto that of the polymeric system associated 
with phase B: 

Mba{M) = P['Kb{{u})~J{a{{u})] 
3 ^ 

= ^Y.^^B{Vii) - ^A{Vi^)) (6.70) 
i=l 



CHAPTER 6. QUANTUM FREE ENERGY DIFFERENCES 



148 



By comparing Eq. 16.661 to Eq. 12.21 it is immediately clear that the overlap identity (see Eq. I2.21> will hold even for our 
quantum system. If vr^ denotes the quantum sampling distribution of phase 7: 

^9=^^({u})=e-^=«-«"» (6.71) 

then it immediately follows that: 

J<-BA = ^TTT 7-qT (6-72) 

P(Mba\'^b) 

From Eq. 16.721 it clear that all the discussions of the previous chapters (with the exception of chapter |3j also apply to the 
problem of estimating the quantum mechanical FED's. In particular the vast array of estimators derived from Eq. I2.21l and the 
various extended sampling strategies used to overcome the overlap problem may also be used in the quantum case of Eq. 16.661 
Later on we will use the PS estimator in conjunction with the MH extended sampling strategy to estimate 31 ba for several 
different values of P (see section l6'.4.7> . However before we do this we will derive the quantum version of the FSM. UnUke 
the Q-RSM, the quantum FSM (Q-FSM) is considerably different in appearance from its classical counterpart. 

6.3.2 Quantum Fourier Space Mapping 

It is a straightforward exercise to re-write the expression in Eq. I6.66l in terms of some effective configuration v (see Eq. I2.5> . 
corresponding to a PM which matches the fourier coordinates of each and every replica: 

Rba = e-'3[^^^-^^:^l3?5^ (6.73) 

and 



'Rba= \^T[,[detSBAf^BA,p (6.74) 

or 



W = + (6.75) 



where this time: 



/dvi dvpe-'3^«(W) 

^^^'^ - /dvi dvpe-/5«-(W) (^-^^^ 

with: 

/3M^({v}) - ^[ (T^v,+i - T^v,)2 + t£^(v,+i)] (6-77) 

^AqT 

The relevant macrovariable which quantifies the cost of the mapping then generahses to: 

MBAiM) - P[^B{{TB^r}) ~ J{a({T^v})] (6.78) 
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Though perfectly valid, there are two problems with the PM as formulated in Eq. 16.761 The first is that the kinetic action 
gets modified under the corresponding mapping of configurations. That is even though such a FSM matches the harmonic 
contribution to the configurational energy of each and every replica, the fact that Sk gets modified means that on the transition 
to a large number of replicas the cost of making a PM will become energetically expensive, thereby reducing the overlap 
between the two phases. In this case even if the quantum system becomes harmonic at very low temperatures, the guarantee of 
curing the overlap problem in the harmonic limit will no longer exist. In fact since the harmonic inter-replica interactions get 
stronger for larger P and end up dominating the overall action (see appendixD, and since larger values of P will be needed at 
lower temperatures, it follows that the cost of the PM, as measured by Eq. 16.781 will become greater the lower the temperature. 

An alternative formulation reveals itself when we notice that the kinetic action in Eq. I6.28l is a quadratic function of the 
displacements u. Therefore if the system only explores the harmonic parts of the configurational energy £ then the overall 
action St will itself be a quadratic function of the displacements u. In this case it is possible to define a mapping with ensures 
perfect overlap between the two systems. The construction of the transformation follows a similar procedure to that used to 
derive the classical transformation S ba (see chapter|3|l. We start by expanding the action in Eq. I6.28l as a power series in the 
displacements to yield: 



2—1 ^ 

where K is the dynamical matrix (see Eq. 13.81 . We may then approximate the total action St by: 



(6.79) 



St = u^rJu 



(6.80) 



where 



U2 



V up ; 

and U(,,_i)(37v)_|_i = (u^)i where 77 denotes the replica (assuming the values I through to P) and i denotes the component 
(taking the value 1 through to 3N). It is not hard to show from Eq. I6.79l that: 

2 1 

^l{v-l){3N)+i]l{u-l){3N)+j] = ('^K.y + - -^^[Sr^l^-i) + (^,,(,.+1)] (6.81) 



Following the procedure employed in deriving the classical FSM (Eq. I3.16t . we may immediately write down the transforma- 
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tion matrix for the Q-FSM: 




(6.82) 



where now e™ and are the normalised 3NP component eigenvectors of il^ and fls respectively, /c™ are the associated 
eigenvalues, and the summation J2m '^^^^ non-null eigenvalues of fl, and where the indices i and j span the values 
through from 1 to 3NP. In this formulation the 'global' displacement vector of one phase, u"* say, is mapped onto that of the 
other phase via the relation: 

= S|j^u^ (6.83) 

such that the total actions of the two polymers are matched. This transformation may be conceptualised as the following 
mapping: 



The partition function is now given by: 



A ^ B: Ra ^ 'R.B u ^ S| ,u (6.84) 



Rba = e-^^^"^-^°^^5iBA (6.85) 

where 



JIba - lim [det S%a]^ba,p (6.86) 

where 

^ ^ Jdu? rfugcxp{-EL[^(uf^,-uf)^+r£B(uf^,)]} 

and where {u"*i} and {u^} are related via Eg. 16. 831 The transformation S^^ ensures that the quantity in Ea. l6.80l is identical 
for the two phases. 

Unlike the classical FSM, this quantum version of the FSM will not necessarily guarantee an improvement in the overlap 
as the temperature is reduced. The reason for this is that the presence of zero point motion means that the system may explore 
the anharmonic regions of the configurational energy even at T = 0. However in cases where the quantum effects (zero point 
motion) are not strong enough so as to force the particles to visit the strongly anharmonic regions, then the quantum FSM, as 
we have formulated here, might serve as a useful tool. 



CHAPTER 6. QUANTUM FREE ENERGY DIFFERENCES 



151 



6.4 Implementation details and simulation results 

6.4.1 Motivation 

An archetypal example of a solid in which quantum effects are important is ^TJe 11681 . In low density solid helium, the atoms 
have a large zero point motion due to the small atomic mass 12131 . Since the atomic interactomic configurational energy is 
relatively weak as compared to the zero point motion, the lattice expands due to the outward pressure arising from this zero 
point motion. The result of this is that the solid is destabilised at much lower densities than would be allowed classically 12141 . 
In addition to this one finds that for '^He the result of this zero point motion is that the particles, on average, sit at the relative 
maximum of a bimodal configurational energy 12131 . 12151 . 12161 . resulting in imaginary frequencies of the dynamical matrix, 
rendering the harmonic description inaccurate. As the density is increased, the configurational energy eventually comes to 
dominate the zero point motion, and as a result the crystal develops a single well configurational energy which localises the 
particles to their lattice sites, resulting in the harmonic description becoming accurate II213I . 

As noted before, in addition to zero point motion, a phenomena called exchange arises in the case of indistinguishable 
particles. In solid '^He this has little effect since the fact that the atoms have no spin to label them means that there is no direct 
consequence of exchange (see I168I V In the case of solid ^He the fact that the atoms have non-zero spin means that exchange 
plays an important role in determining the magnetic properties (see fT68l) of the solid. 

The phase diagram of ^iJe is shown in figure l63l '*iJe may, at the crudest level, be described by the Lennard-Jones (LJ) 
configurational energy, which provides a reasonable model for the rare gases 12181 . II219I . However in order to accurately 
determine the phase diagram one needs to employ more accurate configurational energies (see I2201 - I223I '). In the rest of this 
chapter we will concern ourselves with the fcc/hcp regimes of the phase diagram (as modelled by the LJ configurational en- 
ergy). Our goal will not be to provide definitive statements about the phase diagram but instead to investigate the methodology 
developed in this chapter and to get a qualitative feel for the effect of zero point motion on the relative stability of the fee (A) 
and hep (B) crystalline structures. For detailed studies of the quantum LJ solid, we refer the reader to 12241 - 12261 . 

6.4.2 The Model System 

As was the case in the classical simulations, the reduced density pa^ was set to be pa^ = 1.092. In addition to the parameters 
T* and pa^, which enter into the classical simulations (see section ll!2l . one also encounters the additional parameter in 
the case of the quantum simulations. For our LJ systems this was fixed through the De Boer parameter 12271 . 12281 D given 
by Eq. 16.371 In appendixlHlwe clarify the way in which the de Boer parameter enters into the calculations. 

Initially simulations employing the same systems as those described in section l3T2l were implemented in order to estimate 
the expectation of thermodynamic variables such as the kinetic energy and the configurational energy, with good accuracy 
being obtained with both the PA (Eq. I6.26t and the HOA (Eq. I6.51> schemes. However on attempting to estimate the FEDs, it 
was found that these systems were too large for us to handle with the available computational resources. This meant that we 
had to restrict our simulations to a system size of N=96, which, from the way the fee and hep unit cells were constructed (see 
figure l3^ . was the smallest system size that could be used. A consequence of this system size was that in order to fulfil the 
requirements imposed by the minimum image convention [58 1 particles could only interact with their first nearest neighbour 
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shell (comprising of 12 particles), so that — l.lr„„ (see section l3^ . 

6.4.3 Sampling the polymer 

In principle the simulation of a system whose hamiltonian 'K is described by Eq. l6.26l or Eq. I6.51l is a straightforward task. 
One merely performs single particle perturbations, and accordingly accept these moves with the acceptance probabilities given 
in Eq. 11.271 In practice however this is not an efficient way to sample this polymeric system. The origins of this lie in the 
inter-replica coupling term -^j—p (ri+i — r;)'^. As the temperature is lowered an increasing number of replicas will need to be 
employed in order to keep the systematic errors controlled at some prescribed level 12291 . A consequence of this is that the 
harmonic inter-replica coupling will get stiffer, resulting in the decrease of the average size of an accepted move. Roughly 
speaking we see that since the kinetic action is a gaussian like term with a prefactor which increases linearly with the number 
of replicas, the mean square displacement should roughly scale as l/\/P- This leads to a critical slowing down 11681 of the 
simulation in this limit, and severely hampers the simulation. 

In order to alleviate this problem, one must introduce an additional move to the single particle moves already present in 
the classical case. This move is a global polymer move 11681 in which one moves a whole polymer without changing its 
conformation. Such a move leaves unaffected the kinetic action Sk- As a consequence it is only the change in Sv which 
enters into the acceptance probability of such moves. Both moves are important; on the one hand the global-polymer moves 
allow faster exploration of the configurational energy £, whilst the single particle moves allow the different conformations of 
the polymer to be explored. For the simulations considered here, it was found that an implementation of the global polymer 
move for every P single particle moves was optimal, in the sense that the correlation of the underlying data was kept to a 
minimum. 

6.4.4 Testing the algorithm 

In order to check that the algorithm was functioning correctly, two separate checks were made on the simulation. In the first 
a harmonic configurational energy was constructed, and the analytic results for the mean total energy < H > were compared 
to that obtained by the simulation. In the second a LJ configurational energy was employed, and the parameters were adjusted 
so as to get the simulation to explore a region of the phase diagram in which the quantum effects were non-negligible and in 
which only the harmonic regions of the configurational energy were visited. In the latter case "JI-ba was also estimated via a 
MH-PS simulation and compared to the corresponding analytic result. 

6.4.4.1 Harmonic Potential 

In order to test our algorithm, we considered a system interacting via the interatomic configurational energy: 




(6.88) 



j e nn{i) i 
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where 



1 



-Krf 



(6.89) 



where nn{i) denotes the set of particles which interact with particle i. It is well known II 121 that for a solid interacting via a 
harmonic configurational energy, the mean total energy may be obtained exactly from: 



where: 



(6.90) 



1 



where uji — \/ ^ and where Xi are the eigenvalues of the dynamical matrix of £(r), given by: 



(6.91) 



SikSap X K X n{i) - K[l - (5ifc][^Q;3] : if fcenn(i) 

: otherwise 



(6.92) 



where ^ ■ denotes a summation over the modes, and where n{i) denotes the number of members in the set nn{i). 

Figure lOl shows a graph comparing the estimates of the expectation of the total energy as obtained by simulation with the 
theoretic values as predicted by classical and quantum statistical mechanics. The curve for the classical theory was obtained 
from the equipartition theorem |4l-||6j which states that for a system of particles interacting via a harmonic configurational 
energy, the expectation of the total excitation energy is given by: 



<H >=E" + (3iV - v)kT (6.93) 

where N is the number of particles (3N represents the number of degrees of freedom) and ly is the number of constraints. 
For simulations with periodic boundary conditions = 3. The curve corresponding to the theoretical predictions as made by 
quantum statistical mechanics was obtained from Eq. 16.901 

The first thing that one notices is that the results of the simulation are in complete agreement with the curve as extracted 
from Eq. 16.901 Comparing the quantum and classical curves, we notice from figure 16.41 that the quantum graph does, as 
expected, converge onto the classical line on the transition to sufficiently high temperatures. However at lower temperatures 
the situation is different. Here the mean total excitation energy levels off and assumes a constant value, corresponding to 
the term J^i'^^i arising in Eq. 16.901 This is the zero point energy of the system and arises from the inherent motion of the 
particles, present even at Kelvin. This is purely a quantum phenomenon and arises from the Heisenberg uncertainty principle. 
In contrast the mean total excitation energy vanishes in the classical limit, and results in the departure of the quantum system 
from the classical curve as seen in figure lOl 
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Figure 6.4: A plot of the analytic results versus those obtained by (HOA) simulation for the mean total excitation 
energy < H of the quantum system interacting via the harmonic conf igurational energy, Eq. 16.891 

As the temperature is increased the system becomes increasingly classical and hence the relationship between the total exci- 
tation energy and the temperature becomes linear, as predicted by the equipartition theorem (Eq. I6.93> . As the temperature is 
reduced to zero the quantum discretisation effects become increasingly important and the excitation energy becomes constant, 
due to the presence of zero point motion. It is for this reason that the graph for the quantum system departs from the classical 
line in the limit of low temperatures II230I . 
The range of temperatures were from kT=0.00001 to 0.0009. 
Xq = 0.5 X 10-9, K=1.0. 

As will all the harmonic calculations via Eq. 16.891 < iJ > is expressed in units of k. 
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6.4.4.2 Lennard Jones System in harmonic regime 



The test implemented in the previous section was also implemented on the LJ hep (B) and fee system (phase A). The param- 
eters T* and D were appropriately adjusted so as to ensure that only the harmonic parts of the configurational energy were 
visited. The eigenvalues of the dynamical matrix, obtained via numerical methods, were then used to determine the ana- 
lytic value for the mean total energy of the system, via Eq. 16.901 This was then compared to the result obtained by simulation 
and used to verify that the simulation was indeed visiting only the harmonic regions of the LJ configurational energy. In the 
next stage a check was made in order to verify that the simulation was indeed estimating the FEDs correctly. The (quantum 
mechanical) analytic values for 'JI'ba were obtained via the relation II 121 : 



3^. 



Za 



1 



-} 



(6.94) 



where cu^ denotes the frequency of the i-th mode of phase 7. These results were then compared to those obtained by a 
simulation employing the HOA scheme (section [6. 2. 6> using the MH-PS method (section l43t . The classical value was 
obtained from Eq. 13.191 The results have been tabulated below: 







^BA 


analytic: classical harmonic 


285 


0.810 


analytic: quantum harmonic 


2327.39 


0.671 


analytic: simulation 


2328.7 ±0.2 


0.678 ±0.006 



Table 6.1 : The values of JI-ba for a harmonic LJ quantum system 



D = 1.816 X 10-^ T*=0.005, P=20 



The first column of table l6.1l verifies that the system of particles, interacting via the LJ configurational energy, were indeed 
exploring only the harmonic regions of the LJ configurational energy, since the simulation results agree with the analytic values 
as predicted by harmonic theory (Ea l6.94> . The analytic values and the simulation results for H-ba (see column 2 in table 
16. U clearly agree to within the errors, and differ significantly from the classical value. It is clear the effect of the increased 
amplitudes of vibration arising from the zero point motion (see figure lOt act, in this regime of the phase diagram, favourably 
towards the fee (A) phase, making it more probable (relative to the hep (B) phase) than it would be in the classical case. This 
is expected since in the classical case 1351 the increasing amplitudes of vibration obtained on increasing the temperature also 
favours the fee (A) phase, within the harmonic regimes. 



6.4.5 Scaling of thermodynamic parameters with P 

In practice a PIMC simulation necessarily involves a finite number of replicas. Unlike in section 16.4.41 where we had an 
analytic check so as to allow us to determine whether a sufficient number of quantum replicas had been employed, one will 
not have an idea as to the magnitude of the systematic errors arising from the finite replica simulations in the general case. 
As mentioned before the only information available to us is the way the associated systematic errors scale with the number of 
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Figure 6.5: A plot of the distance of a particle from its lattice site, as averaged over all particles (harmonic quantum 

Lennard-Jones solid) 

u denotes the distance a particle from its lattice site, averaged over all particles of the system. Though the means of the 

distributions correspond to the mean displacement of a particle from its lattice site, the associate spreads of the displacement 

of a particles from its lattice sites are ViV = \/96 w 10 times wider than that shown in the figure above. 

It is clear that in the quantum case the zero point motion pushes the particles out to regions further from the lattice sites than 

would be the case classically. 

T* = 0.005, i) = 0.00001816. 
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P 

Figure 6.6: < H >p.7r« -E% versus -pj- (PA, harmonic) 
The three values of P chosen were P=20, 30, 40. 

The line offit was given by: <H>p .^i^^^ -£;^ = 1551.4 - 22353(l/p2). 

The analytic value was: < H >= i?^ + 1551.61. This agrees well with the intercept of 1551.4 (in units of k). 
kT = 0.0001, \g = 0.5 X 10-^ K=1.0. 



replicas. By plotting the desired expectation as an appropriate power of the inverse of the number of replicas one may hope to 
obtain the asymptotic value that the expectation assumes in the limit of an infinite number of replicas. 

Specifically we saw in sections [6.2.51 and 16 .2 . 61 that the expectations of an observable should have errors which scale as 
(for the PA) and 1 /P^ (for the HO A). In order to illustrate this scaling, simulations of the harmonic system as described 
by the configurational energy given in Eq. I6.88l and Eq. I6.89l were implemented via both the PA and the HOA methods. The 
estimates of the asymptotic limits were then extracted via the appropriate graphical extrapolation techniques, and the results 
were then compared to the analytical value of the total energy. Figure |6^ and figure [6J\ illustrates that the scaling of the 
expectation of the total energy does indeed follow that of Eq. 16.471 and Eq. 16. 5 61 for the PA and HOA methods respectively, 
yielding the correct value (as obtained analytically) in the asymptotic limit. 

Given that both methods accurately determine the asymptotic value of the mean total energy, the question now remains as 
to which yields a smaller error (for a given computational resource). In addressing this issue, we first note that the number of 
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Figure 6.7: < H >p.7r« -E% versus -pj (HOA, harmonic) 
The three replica values that were simulated corresponded to P=10, P=15, and P=20. 
The line of fit was given by: < H >p.-ni -E% = 1552.5 - 124630(1/P''). 

The graph shows a plot of < H >p,7r«, a function of -p^ for a system of particles (in phase B) interacting via the 

harmonic configurational energy (Eg. 16.89V The analytic value was < H >= Eg + 1551.6 (in units of k). 
kT = 0.0001, Aq = 0.5 X IQ-^, K=1.0. 
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replicas chosen in figure lSTSI and figure l6^ were the minimum needed in order to be in the appropriate scaling regimes. Below 
this range the mean total energies no longer scaled as and for the PA and HOA methods respectively. From the 

graphs it is evident that half the number of replicas were needed in the case of the HOA method than were required for the 
PA method. However it was also found that the HOA method took roughly double the amount of time to perform a given 
number of lattice sweeps, as compared to a PA simulation employing the same number of replicas. In order to understand this 
we note that since the higher order hamiltonian of Eq. I6.51l has the additional term [V£(r)]^ to the primitive one (Eq. I6.26> . 
and since this term has to be computed over the same number of nearest neighbours for each particle as one would have to do 
when calculating £(r), the HOA method will require roughly double the time to simulate. As a result we see that a 2P-replica 
PA simulation will take the same time as a P replica HOA simulation, thus offsetting any gains initially offered by the HOA 
method. 

All that is left to compare between the two methods is the correlation of the underlying data. Figure lOl shows the ratio of 
the error (Jho.p obtained in a P replica HO simulation to the error (Jpa.2P obtained from a 2P replica PA simulation, run for 
the same duration of time 12311 . Clearly figure lOl shows that in regards to correlations, the HO method has a slight advantage 
over the PA method, and for this reason we employed the HOA method in our attempts at estimating the FEDs. We also note 
that the trend of the graph indicates that this advantage increases as the number of replicas increases (for the systems studied 
here). 

6.4.6 Dependence of P{MabWb,pa) o" P ^ 

As we have seen before, the statistics of the macrovariable Mab essentially contains all the information of the FED between 
the two phases. Therefore it is instructive to examine the dependence of these distributions on P and T. Figure l6^ shows the 
quantum probability distribution P{Mab\i^'b pa) for different temperatures (and fixed replica number); figure lOol shows 
the classical distributions P{Mab\''^'b) for different temperatures; and in figure l6?T2l PfAf 4 rItt^, p^) is plotted (at a given 
temperature) for different numbers of replicas. 

The first thing that is clear is that for the quantum case the behaviour of P{Mab\t^'b pa) '^o'- monotonic with the 
variation of the relevant parameters. In figure l6^ the peak of the distributions initially move in towards the origin with an 
accompanied decrease of variance. However beyond a certain temperature (T ^ 0.1), the mean and variance of the associated 
distributions start to increase as the temperature is raised. 

The reason for this observed behaviour is as follows. At sufficiently low temperatures the zero point motion of the particles 
force the system to wander further away from the lattice sites (see figure than would be the case in the classical limit, 
resulting in the peak of the distribution being further away from the origin than would be expected in the classical case. This 
is clearly the case with T* = 0.01 (compare the graphs in figures lOl and figure l6?Tol i. As the temperature increases, the 
contribution to the energy of the zero point motion remains constant, since the typical energy has not yet reached that of the 
phonon excitation energies. From the definition given in Eq. 16. 701 it immediately follows that Mab must decrease. As the 
temperature is increased even further the thermal excitation contributions to the energy begin to become important. In this 
case the rate at which Mab decreases will itself reduce (since the decrease due to the division by T will be offset by the 
increase in -pX^ili b (u^) — £.a (ui)) , until eventually Mab starts to increase. On further increase the system will become 
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Figure 6.8: Comparisons of tine errors of a P replica HOA simulation to a 2P PA one 

For the systems studied here it was found that roughly twice as many replicas were needed for the PA method as compared to 
the HOA method in order to ensure that the simulation was in the appropriate scaling regime, so as to allow one to arrive at 
an estimate of observables, free of systematic errors, via the graphical extrapolation technique described in sections l6.2.5l and 
sections 16.2.61 However the HOA method took twice as long (as compared to a PA simulation employing the same number 
of replicas) to achieve a given number of sweeps, thereby offsetting the advantages just mentioned (since a 2P-replica PA 
simulation will take just as long as a P-replica HOA simulation). This graph shows the ratios of the errors of a 2P-replica PA 
simulation to those of a P-replica HOA simulation, and indicates that the HOA method has a marginal advantage over the PA 
method. The trend seems to be such that this advantage increases as P increases. 
D = 0.1816. T* = 1.5. 
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Figure 6.9: P{Mab\ttb,pa) for different T, P constant 
The probability distribution P{Mab\t^b pa) obtained for a selection of temperatures ranging from T* = 0.01 to T* = 
1.0 for the LJ configurational energy. 
r> = 0.001816, P=10. 
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Figure 6.1 0: P{MAB\Tr%) for various T* for tine classical simulations 
This figure shows the classical distributions P{Mab\t^b) corresponding to those of figure |6^ 
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Figure 6.11 : Temperature dependence of the mean distance of a particle from the lattice site 

a)T* = 0.01 b)T* = 0.05 c)T* = 0.1 d)r* = 0.2 e)r* = 0.5 f)T* = 1.0 

See figure l631 for an explanation of the way the mean displacement is calculated. 

The graph shows the average displacement of the particle in the quantum and classical limits, corresponding to the simulations 
shown in figures lOl and figure l6?Tol It is clear that at low temperatures, the effect of the zero point motion is important and 
results in a significantly greater mean displacement of the particle from its lattice site than in the corresponding classical case. 
This results in the peak of P{Mab {t^b pa) being positioned significantly further out from the origin than the corresponding 
classical distribution (see figure \63\ and figure l6?Tol . As the temperature increases, the zero point motion becomes less 
important, resulting in the gradual convergence of the two distributions. 
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Figure 6.12: P(Mab\ p^) versus P, for a constant T simulation 
The probability distribution P(M^B|7rp p^) was obtained for a range of replicas ranging from P=10 to P=130, with the 
temperature being kept fixed at T* = 0.4. 
L> = 0.1816. 
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classical and the (classical) thermal effects will mask the quantum zero point motion, at which point the difference between 
the graphs in figure l6T9l and figure l6?T0l arising from the effect of the zero point motion, will eventually be negligible. 

Figure l6T2l shows the probability distribution P{Mab\t^b pa) ^'^^ different numbers of replicas. Initially as the number 
of replicas increases the peak moves away from the origin (up to P ^ 20). Further increase in the number of replicas leads to 
the peak moving closer to the origin, converging (by P ^ 130) to the stationary distribution, which is ultimately positioned 
further away from the origin than the corresponding classical distribution. The important thing to note from figures 16.91 
and 16. 121 is that as quantum effects become increasingly important 12321 . the peak of P{Mab\''^'b) moves away from the 
origin (relative to the classical distribution) and its width increases. This means that in addition to the fact that the quantum 
simulation is inherently more time consuming, additional time must also be spent refining the sampling strategy (whether 
it be increased amounts of multicanonicalisation in the case of MUCA simulations or increased numbers of replicas being 
employed in the MH method) in order to estimate the FEDs. 

6.4.7 FEDs 

The primary motivation for developing the path integral machinery in the preceding sections was to use it to estimate quantum 
FEDs. In fact by formulating it in the way that was done in Eq. 16.661 we made available to ourselves the vast array of tools 
discussed in the previous chapters that are suitable for tackling this type of problem. In this final section we discuss our 
attempts at estimating the quantum mechanical FEDs. In estimating the FEDs, our aim was to investigate the role of the zero 
point motion on the relative stability of the two phases in a regime of the phase diagram in which both the quantum effects 
and the anharmonic effects were significant. Figure l6?T3l shows the estimates of the FEDs, obtained via the MH-PS method, 
in such a regime. 

It is clear from figure l6T3l that the quantum effects essentially act so as to favour the hep (B) phase (as compared to the 
classical case). This can be understood in the context of results obtained in the classical simulations. In 1351 the classical LJ 
system was studied and it was found that the increasing anharmonicity (obtained on increasing the temperature) favoured the 
hep (B) phase. This conclusion was arrived at by comparing the simulation results to the harmonic calculations. The same 
effect is likely to be the cause of the quantum effects favouring the hep (B) phase. That is the zero point motion pushes the 
particles into regions further out from the minimum of the configurational energy than they would typically explore in the 
classical case, making the system more anharmonic. As is the case in the classical systems, this anharmonicity acts in a way 
which favours the hep (B) phase. This is in sharp contrast to the harmonic regions of the quantum phase diagram (see section 
16.4. 4. 2> . where the increased amplitudes of vibration acts so as to favour the fee (A) phase (as is also the case in the classical 
regime, see lISSl l. 

6.5 Discussion 

The quantum Lennard- Jones phase diagram has not been determined yet via computational techniques and the work here 
represents a first step in developing the necessary machinery for a move in that direction. The factors limiting our investigation 
are the following: 
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Figure 6.13: CUba,? versus -p^ (HOA) for the LJ configurational energy 

The graph shows the plot of the quantity ^■sa,'P against for the LJ configurational energy. 
The asymptotic value extracted from the plot: 3is^,y = 0.9 ± 0.02. 
The classical value: 01-ba,7 = 0.844 ± 0.004. 
T* = 1.5. 
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1 . The slowing down associated with the simulation of a system of interacting polymers over that of a system of interacting 
particles. 

2. The increasing number (P) of replicas needed as the temperature is reduced. One not only has an increase in computa- 
tional costs due to the fact that one has to simulate more replicas, but also a critical slowing down associated with the 
increasing strength of the inter-replica forces. 

3. The need to perform graphical extrapolation in order to obtain a single estimate of the expectation of an observable. 

In order to accurately determine the phase diagram (using the same number of processors that we used) at the temperature we 
chose, one would need to employ a 12 x 12 x 12 system. Since the time, associated with keeping the error in the estimate 
of the FED at some prescribed level, scales roughly as (for short ranged interactions) we see that the simulation of a 
12 X 12 X 12 system would involve an increase in computational requirement of approximately [ g^^^^ ]*^ — 5832 times. In 
accordance with Moore's Law, this sort of computational power will be available to us in about 13 years. 

However a significant feature of the simulations the way that we have done it (i.e. via the MH route) is the enormous 
scope for parallelisation. This parallelisability does, in principle, allow us to determine the phase diagram accurately even 
today, simply by distributing the replicas amongst an increased set of processors \23^. In our simulation we employed 256 
processors. Therefore to perform the above calculations one would require 1.5 million processors. With the rapid expansion 
of parallel clusters (e.g. EPCC hpcx) the MH method should make the task of determining the quantum phase diagram a 
realistic project at a much earlier time than that predicted above. 



Chapter 7 



Conclusion 

In order to determine the location of a phase boundary between two phases one must determine at which point in the phase 
diagram the FED of the two phases is zero. The simplest approach is to tackle the problem via computational techniques 
(Monte Carlo) whereby one determines the weights of macrostates of one phase relative to those of the other. From this one 
may then infer the ratio of the partition functions of the two phases. 

The problem with this approach is that generally a simulation initiated in a given phase will not visit the regions of 
(absolute) configuration space associated with the other phase, since the two phase will in general be separated by a region 
of configuration space of intrinsically low probability. As a consequence one will not be able to determine the weights of 
macrostates of one phase relative to those of the other phase. This is generally referred to as the overlap problem. 

One way to circumvent this problem is to use the PM formalism j8| in which one directly maps the configurations of 
one phase onto those of the other phase. By choosing an 'intelligent' PM one may generate considerable overlap between 
the two phases. In constructing the PM there are two issues which one must give consideration to. The first is the choice 
of a reference configurations and the second is the choice of coordinate systems (v), or representation as we call it, 
with which one parameterises the displacements of the particles from the reference configuration. Since the PM matches the 
coordinates (v^ = v^), it is clear that the overlap is dependent upon both R and v. The simplest and most straightforward 
choice of the representation is that in which the coordinates are expressed in terms of the displacements u of the particles from 
the reference configuration R^. We call the associated mapping the RSM. Another possible choice is one in which the one 
parameterises the degrees of freedom in terms of fourier coordinates of the system. This we call the FSM. For the FSM one 
finds that, in the case of structurally ordered phases, the overlap problem vanishes as the harmonic limit is approached (see 
chapter|3}, provided that the reference configurations are chosen to be the ground state configuration (i.e. the lattice sites). 

Generally however the scope for refinement of the representation is limited, and one finds that the overlap problem persists. 
The second strategy that one naturally encounters is that of the estimator which one uses to determine the FED. The choice 
of the optimal estimator depends on the way in which the regions of (effective) configuration space associated with the two 
phases overlaps. In the case where they overlap in the manner shown in figure^j](a) the EP estimator (Eq. I2.26> . in which one 
performs a single simulation in phase A, yields an estimate which is free of systematic errors. In the case where they overlap 
as shown in figure 1411 (b) then one must use estimators which involve simulations in both phases. In this situation one may 
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either choose to use a phase constrained estimator in which one performs two simulations, one in each phase, and in which the 
non-negligible contributions come from the region of overlap (Eq. 14. 31 or Eq. I4.20l where G is appropriately chosen), or one 
may employ the PS estimator (Eq. I4.47> in which the sampling distribution actually switches phases. However the validity of 
the estimates arising from these estimators presupposes some form of overlap in the regions of (effective) configuration space 
that the two phases explore. In the most general case, however, there will not be any form of overlap and therefore, like the 
choice of representation, the scope for refinement of the estimator will be limit. 

The final part of the FED problem is that of the sampling strategy. In this case one refines the sampling distribution in 
order to engineer overlap. Broadly there are three generic sampling strategies that one may pursue. The first is the MUCA 
strategy, whereby one introduces corrections to the Boltzmann weights appearing in the acceptance probabilities so as to 
force the simulation to explore regions of (effective) configuration space outside those it would normally explore (using the 
canonical probability distribution). The second is the MH strategy, whereby one simulates several systems independently. By 
simulating a series of systems in such a way that they overlap in the regions of (effective) configuration space that they explore 
and which, taken together, connect the regions of configurations space associated with one phase to those regions associated 
with the other phase, one is able to determine the FED. The advantage of this method is that it is highly parallelisable. The 
final strategy is the EG method, whereby one performs non-equilibrium work on the system so as to force it from the regions 
of (effective) configuration space associated with one phase to those of the other. By ensuring that one performs work in 
a gradual II234I . as opposed to abrupt, fashion, one may generate arbitrary overlap between the two methods. The overall 
sampling strategy may also involve combinations of these methods (see section I53t . 

The key components in tackling the FED problem have been summarised in figure mi 
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Figure 7.1 : Overview of tine strategies involved in estimating tine FED 
The figure above summarises the key components involved in tackling the FED problem. Broadly speaking there are three 
generic strategies that appear in tackling the overlap problem: the choice of representation, the choice of estimator, and the 
choice of sampling strategy. Generally the scope for refinement of the representation and estimator is limited, and therefore 
in order to fully overcome the overlap problem one must refine the sampling strategy. 

The horizontal arrows indicate the direction in which one may generalise. That is the elementary (or zero equilibration FG, 
Eq. I2.13> formulation may be generalised into the MH one (Eq. I5.14t . Likewise the MH formulation itself (by defining a 
non-equilibrium process (see section 1^.4. 8> on the hamiltonians "Ka and "Kb) may be generalised so as to be incorporated 
within the FG formulation, Eq. 12.1011 



Appendix A 

Proof of the fluctuation theorem 



In this section we set out to prove the fluctuation theorem as given by Eq. 12.1011 The original proof was given in L73J : we 
rederive it for the sake of mathematical clarity. The proof that is given here is the particular case of that given in 1731 in which 
A changes discontinuously from Ai to A2 at time ti, A2 to A3 at time t2, etc for the A ^ B process (and the reverse in the 
B ^ A process). 



A.1 Proof of the fluctuation theorem 

We start by deriving a result which is central to the whole procedure. Consider a simulation employing the Metropolis 
algorithm in which the sequence of configurations {ai, (T2, (T3, ...} is generated. Under the scheme of the metropolis algorithm 
the probability of the system going from ctj to ai+i is given by Psi^i ct^+i) (see Ea. ll.20> . where Ps{o'i <^i+i) satisfies 
the condition of detailed balance (Ea. ll.l9t : 

where Tr{ai) is the underlying sampling distribution. Eq. lA.ll mav be easily extended to the case of two non-consecutive 
configurations: 



/d-l d-1 
[l[Ps{<J, ^ <Jj+i)][l[da 



J = l 
7r(crrf) 



aj) X 



7r(CTi) 



d-l 
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7r(CTi) 



Ps{(^d CTi) 



(A.2) 



or: 



Ps{(^d CTi) 7r(cri) 



172 



(A.3) 



APPENDIX A. PROOF OF THE FLUCTUATION THEOREM 



173 



We may now proceed to prove the fluctuation theorem. Suppose that P4^s({v}|v(l)) denotes the probability of obtaining 
a path {v} given an initial configuration of v(l) in the A ^ B process. Then since the initial configuration v(l) is sampled 
from the distribution tt^, it follows that the distribution of the paths is given by: 

J'A^sdv}) = ^^(v(1))P^^b({v}|v(1)) (A.4) 

where: 



Pa^s({v}|v(1)) = F5(v(l)^v(2)|7rA,)xFs(v(2)^v(3)|^A3)-. 

P5(v(n-2)^v(n-l)|7rA„_J (A.5) 

where Ps{v{i) — > v(i + l)|7rAj_|_i) denotes the probability of the system making a transition from the configuration v(i) to 
v(i + 1), between times ti and t^+i, under the sampling distribution . Similarly the path {v} for the B A process is 
sampled from the distribution y%^A. where: 

y%^A{M) = 'r|(v(n - l))FB^^({v}|v(n - 1)) (A.6) 

and where: 



PB^.l({v}|v(n - 1)) = Ps(v(n - 1) ^ v(n - 2)|7r;,_J x Fs(v(n - 2) ^ w{n - 3)|7rA„. J.... 

xPs(v(2) ^v(l)|7rAj (A.7) 



Since from Eq. IA.3 



it foUows that: 



Ps(v(z) ^ v(^ + _ 



= ^ e-^^^^i-(--i)) ^M-P&,.^(-(^ + 1)) - i^.,,,(v(^))]} 

i—1 

= ^ e~P^^r.i^ir.-i)) ^M-P[Ex.-AHn - 1)) - i?2(v(l))]} 



(A.8) 



j=2 

„ n-1 

/exp{/3^[i?,^^,(v(z))-i?,.(v(z))]} (A.9) 
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or: 



Z|.ibM1 = |£ eMWBAiM)} (A.10) 

Defining: 

P(WBA\Tr'A) = J SiWBA-WsAiMWA^BiM) (A.H) 

and similarly for P{Wba\tt%). It follows from Ea. FA.lOl that: 

PiWBA*\n^A) - JsiWBA*-WBA{M)m^B{M) 

= ^e'^--'P{WBA*\7r%) (A.12) 

^A 

or: 

P(W^BAki) = |^e^--*P(P^sA|7r^) (A.13) 
Za 

It is important to note that in this derivation we relied on Eq. IA.3I As a consequence it is essential that the equilibration time 
used to evolve v(i) to v(i + 1) in the A ^ B process is the same as that used to equilibrate v(i + 1) to v(i) in the B ^ A 
process. This is consistent with the interpretation of the B ^ A process as being a time-reversal of the A ^ B process, in 
which the initial configurations are sampled from the distribution tt^ instead of tt^. 
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Fourier Space Mapping witli periodic 
boundary conditions 



This section primarily deals with the modification that must be made to the FSM (Eq. I2.14> in the case where systems with 
periodic boundary conditions are employed, and is relevant to the discussion of section lT.4.11 

Generally the employment of periodic boundary conditions (in conjunction with a pairwise configurational energy) means 
that there wiU be three eigenvectors of the dynamical matrix K which will be of zero eigenvalue. These eigenvectors corre- 
spond to translations of the system. Clearly the fact that they are of zero eigenvalue means that they cannot be incorporated 
into the framework of Eq. I3.9l and Eq. 13.111 Suppose that e^, e^, and ei^ correspond to the null eigenvectors. In this case we 
may express the displacements in terms of the fourier coordinates most generally as follows: 



3A^ e" 



m— 1 m— 4 V 7 

where al^, ai, and are some arbitrary constants, which are associated with the transformation (see Eq. 12. 5> : 



1 , 1 



- (a^ei , a^e^, a^^, ^e^, -^ef) (B.2) 



The ratio of the partition functions may then be written as: 



jU3 h.3N 
7 \ 7 



_ /due-^^-(") _ {nt4^}{/nt4rf-.e-/'^«(v)} 

where C^-y arises from the centre of mass contributions. Since these should not contribute to the ratio of the partition functions 
'Ji'RA we set: 

C^-y = 1 (B.4) 



175 



APPENDIX B. FOURIER SPACE MAPPING WITH PERIODIC BOUNDARY CONDITIONS 



176 



The redundancy of the translational degrees of freedom means that one may omit their consideration altogether in mapping 
the configurations of one phase onto those the other That is the transformation (Eq. 13. 9> may be replaced by a 3N by 
3N-3 column vector which is given by: 



7 



+3 



(B.5) 



' ^-v) / j * 

1,4 1.5 1.3N 

7 \ 1 \ 7 



(B.6) 



The transformation T-y now acts on the (3N-3) column vector v where the component Vi is given by Vi+^. The displacements 
u are then given by: 



u = T^v 



(B.7) 



or 



3JV 



u= Y,', 



m=A V 7 

Likewise the inverse transformation [(3N-3 by 3N) transformation] may be written as: 



or 



(B.8) 



(B.9) 



(B.IO) 



\ • / 

It then follows that the transformation S ba (Eq. I2.14t . which maps the configurations of one phase onto those of the other, 
may be written as: 



3iV 



k 



m=4 
3N 
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m=4 V B 



(B.ll) 
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From Eq. IB.3l we see that the FED may then be written as Eq. 13. 181 where now: 



AF^. =— yin(^) (B.12) 



and 



where 



= \ (B.14) 



Eq. IB. 121 is simply obtained from Eq. IB.3l bv noting that in the harmonic limit £a(v) = £s(v), so that the configurational 
integrals in the numerator and denominator exactly cancel out in this limit. 

The transformation S in Eq. IB.l II through the mapping in Eq. 12.141 ensures that the effective configuration v are 
preserved in mapping the displacements u of one phase onto those of the other 12351 . 



Appendix C 

Perturbation theory for the Fourier Space 
Mapping 



A probability distribution may be completely characterised by its cumulants (eq. I3.24> . Therefore an alternative way to 
investigate the dependence of the overlap on some generic parameter (like the temperature T) is to find the dependence of 
the cumulants on this parameter In this appendix we will specifically focus on the FSM, and we will find relations which 
determine the way the various cumulants of P{AlBA\'^!y) scale with temperature. The primary conclusions of this section will 
be that in the limit of T ^ all the cumulants vanish. The discussion of this appendix is relevant to section U. 5. II 



Let us define the harmonic average of a macrovariable M (v): 

<M>h= -' ^ ^ (C.l) 

where the limits of integration are implicitly assumed to be from — oo to oo. Two results which we will frequently use in this 
appendix and appendixiDlare the following: 



C.1 Preliminary Mathematical Properties of Gaussian Integrals 




(C.2) 



and 




if any of the qi are odd 



otherwise 



(C.3) 



where q — Yll=i1i ^i^^ where c is some constant. 
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For the sake of notational simplicity, we will, in the following section, denote M;y^(v) by Mps when v corresponds to fourier 
coordinates of the system (defined by the relation in Eq. 12.51 when is given by Eq. I3.9l l and Af;,^(v) by Mrs when 
working with the RSM (v = u). 



C.2 Temperature scaling properties of Un 

In the case of crystalline solids a physical motivation exists for the separation of the harmonic contributions to the excitation 
energy from the anharmonic ones. Let us start by first considering the Taylor expansion of the excitation energy £^ (Eq. \2.% 
in terms of the fourier coordinates v: 



ijk ijkl 

= H2 + + Hj + ... (C.4) 

where denotes the summation of all the terms of order n and where ^v^.v corresponds to the harmonic contributions to 
the excitation energy (see Eq. I3.14t . The expectation of a macrovariable M with respect to the sampling distribution tt!^ may 
then be written as: 

< M >,c= ^ / dvAf(v)e-'5(^^+^?+^?+-) (C.5) 
J 

where the partition function Zj may be expanded in the following way: 



= J dve-f'"^ [1 - - /3H2 + ^[H^,r + P'0{a') + pOia')] 

= Zh[l-f3<H2>h+Y<iH^f>h+0{f3^)] (C.6) 

In Eq. IC.6l we have used the fact (see Eq. IC.3> that the integrals of integrands whose overall order of the fourier coordinates 
{v} is odd vanishes. The 0(/3^) terms in Eq. IC.6l is what is left over on integrating the 0^O{a^) and f30{a^) terms. It then 
follows that: 

^ = ^[1 + [[3<H2 >H < >h] + 0(r')] (C.7) 

Since [(3 < Hj >h ~^ < {H^f >h] ~ and since we are only interested in the leading order terms in the temperature 
in Eq. IC.5l as /3 oo, we see that one may replace l/Z^ appearing in the expectations of Eq. IC.SI with 1/Zh in this limit. 
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One may then rewrite Eq. IC.5l as: 



< M >^c« ^ / dvM{v)G{v)e-'^"' (C.8) 

^ J 

where 

G(v) = 1 - pH] + ^[H^]' - PHJ + P'Oia^) + /30(a<') (C.9) 

In order to analyse the cumulants of P{MBA\'^'y) let us consider the particular case when M = Mb a, which we denote by 
Mps- From Ea. lC.4l we see that one may write; 

Mfs = P[SH^ + 6H2 + ...] (C.IO) 

where 

SH,, - HZ - HZ 

The expectation of an arbitrary power of the overlap parameter may then be written as: 

< M^s >-^^«< G{v)f3"{dH3 + SH4 + ...)" >h (C.ll) 

In evaluating the expectation in Eq. IC.lll one will obtain a series of terms scaling in different ways with respect to the 
temperature. Since we are examining the harmonic limit (T 0), we are only interested in the terms which are lowest order 
in T (i.e. highest order in /3). These originate from the integrals with the lowest overall (even) order of v. This means that both 
SH^ and SH4 need to be considered in evaluating the expectation of Eq. IC.l II since, depending on whether n is even or not, 
it might be either 6H3 or dH4 which couple to the lowest order terms of G'(v) so as to yield the most slowly vanishing term. 
Writing Eq. IC.lll in full and retaining only the lowest order terms, one finds that: 

< M^s >-^~ /3" < ['^^^3]" + [SHsr-^SH^ + PH^iSH^r + [SHsr-^SH^PH^ >h (C.12) 

so that: 

< Af^5 >^c- /3" < [5H:iY >^c~ /3-tif n is even (C.13) 

< >^c- /?" < [5H:iY'-^5HA >^e < (SH^iSHs)" >^c- /3-^if nis odd (C.14) 
From these relations we see that the mean and the variance of the overlap parameter scale in the following way: 

lim < Afi^s >7r-oc /3"^ (C.15) 
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and 

lim < M],s >^c - < Mfs >^cCx (C.16) 

It is clear from Eq. IC. 131 and lC. 141 that ujn will scale as < Mpg >7r^, since all other terms in the cumulants will either scale 
with the same or higher power of T. Therefore we conclude that for the FSM the cumulants of P{Mps\TT!y) will scale in the 
following way: 

lim LUn ^ if n is even (C.17) 

/3— ►oo 

In + l] 

lim w„ ~ /3 2 if n is odd (CIS) 

/3— »oo 



Appendix D 

Perturbation theory for the Real Space 
Mapping 

As in appendixlO we examine the temperature dependence of the overlap, as engineered by the RSM, through an investigation 
of the cumulants (Eq. I3.24t . We will derive exact expressions for the mean and variances of P{M=y^ | tt^ ), followed by a general 
argument to show that ti>„ tends to a constant non-zero value in the limit of T ^ 0. The material in this appendix is relevant 
to the discussion of section l3'.5.2l 

D.1 Low temperature limit of uji 

Let collectively denote the displacements of the particles of phase 7 from the reference configuration (which for the 
systems employed here correspond to the lattice sites of the crystalline solid, see section l3T2t . and suppose that [v-^] ; denotes 
the i-th component of the fourier coordinates of phase 7. The RSM ensures that; 

U;y = (Dl) 
Using Eq. I2.5l we see that this constraint imposes the following relation between and : 
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j 



E- 



■"7 ^7 L ^ 7J*Ti 



A[el]^.e7[v,]„ 



so that : 



(D.2) 



I m m 



Urn / Um 



[ef .e7][ef .e7][v,]™[v^]™ 



(D.3) 



From this we see that in the harmonic limit (where the excitation energy is given by Eq. I3.14> AI^^, which we write as Mjis 
to signify the fact that we as using the RSM, may be written as: 



fv^Wv^ (D.4) 



where the matrix elements of W are given by: 



WmA = - ^rhml (D.5) 

/ Urn / Urn 

We will discard the subscript 7 on the variable v-^ since we will assume (for the rest of this section) that v = v^. 

Because W is symmetric (i.e. Hermitian), we may diagonalise it. Typical diagonahzation routines yield eigenvectors, 
which, in the case of degenerate eigenvalues, may not be orthogonal. In this case one may employ the Gram Schmidt or- 
thogonalisation procedure to construct an orthonormal set amongst these degenerate eigenvectors. Suppose that W has the 
eigenvalues {Ki} and suppose that we write b = Nv where N^WN is diagonal. Since N is an orthogonal transformation, 
we see that det = 1 and b^.b = a^.a. Then: 



(v^.W.v) = (b'^(N^.W.N)b) 



(D.6) 



and therefore 
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< Mrs >h = P< v'^.W.v >h 



Alternatively, by directly appealing to Eq. ID.3l and Eg ID. 41 it is not hard to see that: 



j I ^ 

where we have used the fact that < [v-y\m[^i]m >h— ^mm^^^ (see Eq. IC.2t . From Eq. ID. Sl it immediately follows the first 
cumulant for the RSM may then be written as: 

D.2 Low temperature limit of uj2 

In a similar manner, the low temperature limit of the variance of Mrs may be calculated. Using the fact that: 

(v^.W.v)2 = (b^(N^.W.N)b)2 

i 

we see that: 



J — OO 

i 

JE3«?+ E (D.ll) 



Therefore we conclude that: 



< Mis >h-< Mrs >l 



< (v^.W.v)2 >^ _ < 



.W.v 



>2 



(D.12) 
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Accordingly the second cumulant for the RSM may be written as: 



^2 - \Y.^^ (D.13) 



D.3 Temperature scaling of Ur. 



Even though in the preceding section we were able to calculate the exact limiting form of uji and as T 0, extending this 
to the case of a;„ becomes tedious. Instead we will follow the presentation in section lCT^ in order to derive the scaling relation 
of ojn for the general case. In calculating the temperature scaling properties of ojn we first note that for the RSM the harmonic 
terms H2 are not identical in the two phases. Therefore the perturbation expansion of Mrs becomes (as compared with Eq. 



'3 

so that: 



Mrs ~ I3[SH] + SH^ + ....] (D.14) 



<M'^s>n « /3"<G(v)(<5i/2 + 5i^3 + ....)">/. 

= (3"^ <{l-(]H^ + (^[H^]^-pH2 + ...){SH2 + SH^ + ....r>h (D.15) 

It is immediately clear from Eq. ID.lSl that the temperature scaling properties of Mrs will be governed by the leading order 
term in eq. ID. 141 SH2, so that: 

< >.c^ 0(1) (D.16) 
From this we may infer that the cumulants will, for sufficiently low temperatures, be independent of the temperature: 

lim uj„ - 0(1) (D.17) 

13— ^00 

Therefore in the harmonic limit the distribution of Mj^, for the RSM, assumes a stationary form which is not that of the 
limiting form associated with perfect overlap (Eq. 13. U . 



Appendix E 

Determining Statistical Errors 



In this section we discuss the blocking method, which is a way to determine the error associated with an estimate of the 
expectation of an arbitrary macrovariable Q (see Eq. I1.31> obtained from correlated data. We also illustrate the way in which 
this blocking method may be used to estimate the error in the FED estimate. For more detailed information on the blocking 
method we refer the reader to fTsl . 12361 . 12371 . 

E.1 Errors of averages 

Suppose that we make a series of measurements Qi (i=l,...,t), sampled from a probability distribution of mean /i and variance 
. Suppose that it is also our deske to obtain an unbiased estimate for /x. This can be most simply obtained from the mean of 
the data set {Qi}: 

1 * 

^^-Y,Q^ (E.l) 

i=l 

In the case where successive measurements Qi are independent, one finds 11201 . 12381 that the distribution of fi tends to a 
Normal Distribution with mean /i and variance /t. That is P{jl) ^ N{iJL,a^ /t). This is simply a consequence of the 
central limit theorem. Therefore in the particular case where the measurements are independent, the "error" associated with 
the estimate of Eq. IE. H is simply given by a j^ft, where is an unbiased estimator of the variance of Q, and is given by: 

a2 = ^^(Q,-A)' (E.2) 

i=\ 

In the case where the measurements Qi are correlated, Eq. IE. II still yields an unbiased estimate for the mean of the underlying 
distribution of Q. However the associated error is now no longer given by a j \/t. One method for finding the associated error 
is the so called blocking method. In this method, the set of data {Qi} is sectioned into M blocks each containing m data 
entries. That is block i corresponds to the set (5m(i-i)+2j •■•■j '3m(i-i)+m}- Then for each block an estimate 
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jlmi'i) for the mean of the distribution of Q is made, and is given by: 

^ m 

i^m{i) = —^Qmii-l)+j (E.3) 

From this we obtain a set of block estimates for the mean of the distribution: 

{/i™(l),A™(2),....,Am(M)} 

Since: 

1 *^ 

i=l 

it follows that the block estimates {/im(i)} will themselves be distributed with a mean given by /i and a variance given by af^^, 
say. The key observation is that for sufficiently large blocksizes successive jlm{i) will independent. In this case the error of 
the average of the block estimates is simply given by ct^j/M. Since the estimator jl is precisely this average (see Eq. IE.4> it 
follows that, for the (sufficiently large m) regimes where successive block estimates are independent, the error in the estimate 
/t is given by (t(/x), where: 



mat 



(E.5) 



and where ai, is obtained from: 



;^-2 

a^ 



1 

— ^(A™W-A)' (E.6) 



1=1 



In order to find the regime of blocksizes where successive block estimates become independent, a simple graphical procedure 
may be used to estimate the errors in /t. Since mcr,^^ is constant in the regime where the jim (i) are independent (a result which 
must hold true since a^ (/i) is independent of the blocksize), we see that we may determine the blocksizes m for which the 
/tm(i) are independent simply by plotting a graph of toct^ versus m. As m increases the graph will eventually plateau off, 
indicating that the block estimates are indeed uncorrelated. From Ea. lE.5l we see that one may then use the value of the plateau 
(P) to determine the error in the estimate of jl: 



error in /t = 




(E.7) 



For a more mathematically rigorous treatment of the blocking procedure we refer the reader to 12361 . 
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E.2 Errors in the free energy difference 

Though one may estimate the FED by taking an appropriate expectation (Eq. I2.26> . the general expression for the FED will 
involve the ratio of expectations (see for example Eq. 11.81 Eq. 12.891 and Eq. I2.105> . In this case the average of the block 
estimates of JI-ba is not the same as the estimate of JI-ba obtained from the whole data set. In this section we show that the 
blocking method can also be used to estimate the error in ^-ba- As a specific example we use the PS estimator, in which the 
ratio of the partition functions is estimated by determining the (unbiased) ratio of the times spent in the two phases (see Eq. 
I2.89t . For simplicity we consider the case where no weights are employed. 

Suppose that we make a series of measurements -fi (i=l,....,t) of the "phase label" 7 which can take on either of two values, 
A or B say, during the course of PS simulation. Then the estimator for the probability of being in phase A is given by: 

1 * 

p(7 ^A) = -5]<5^.,A (E.8) 

i=l 

and the estimator for the probability of being in phase B is given by: 

p(7 = B) = 1 - p(7 = A) (E.9) 
It is clear from Eq. 12. 891 that In JI-ba (which is proportional to the FED) may then be estimated by: 



, / Time in B , 

D'-^> = In ) 

^ Time in A ^ 

- ln( P^^ = ^\) 



(E.IO) 



If p(7 = B) is deviated from its true value ^(7 = _B) by a small amount as {cb/p{i = B) « 1), then it is easy to show, 
using the approximation ln(l + x) ~ x valid for small x, that the error 6D^^^ in D^^^ is given by: 



^^ln( P^V^) J = (E.11) 

Now let us consider dividing the data of t observations into M blocks, each containing m data points. We may then make a 
block estimate of In 'R'ba ■ 



where: 



D„(z)^ln(-i4^il^^) (E.12) 
1 -pm[^^ = ^) 



^ mi 

p^{i,j^B)^— S^^^B (E.13) 

J— m(i— 1) + 1 
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The question that we now want to ask is how the block estimates Dm{i) in Eq. IE.12l mav be used to determine the error in the 
estimate given in Eq. IE. 101 We note that strictly the blocking procedure will only yield the error of the arithmetic average of 
the block quantities. That is the blocking procedure will estimate the error for the quantity D^'^\ which is estimated by: 

i 

where D'^'^'^ also represents an estimator of ln3ls.Ai though it is not an unbiased estimator We will now show (a result which 
is expected) that in the large t limit, the distinction between l)^^^ and D^'^^ vanishes, so that we may estimate the error in Eq. 
IE. 101 simply by using Eq. IE. 61 and Eq. IE.7l in which jlm{i) = D„i{i) and in which jj. is replaced by D^^\ To see this suppose 
thatpTO(*,7 = B) fluctuates about the true value p(7 — B) by an amount aB{i,m): 



Pm{i,j ^ B) ^ p{j = B) + aB{i,m) (E.15) 

It is clear from Eq. lE.SI that for sufficiently large blocksizes , cts (z, m) will be normally distributed with mean and variance 
cr^(m) say, asii, m) ~ N{0, (jg{ni)). Then it follows that: 



M^p{-f = -p(7 = B)) 
= I>^^^ + -, -, r-y"crs(i,m) (E.16) 



Using the fact that: 



\Y^nB{i,m)\ ^ VM<JB{m) (E.17) 

i 

and using the fact that for sufficiently large blocksizes m: 



cr^M 

m 



(E.18) 



we see that: 



1^(2) _£)(!) I VMasim) _1_ 

'"^ ' Mp.(7 = S)(l-p.(7 = S)) ^ ^ 



so that the distinction between the two estimators vanishes for sufficiently large t. 



Appendix F 



The overlap parameter and the Fermi 
function estimator 



In this appendix we bring out a relation that exists between the fermi function (FF) method and the overlap parameter O (Eq. 
I2.18> . We start off by noting that in the case of arbitrary switching FG the overlap parameter may be generalised to: 



' 77 



so that: 



P{w.-,-,\^-) 

dW=^^ jTr— (F. 1 ) 

1 + %^e^^^ 



0=< ^ >^c (F2) 



< f{WBA~Wrn)>^^^=^d (F.3) 



and 



< f{-[WBA - W^]) (F.4) 

It is immediately apparent (from Eq. IF.2t that knowledge of the overlap O translates to direct knowledge of JI'ba 12391 . The 
point is that a-priori knowledge of iKsyi (or Wm) is not at hand so as to allow an estimation of O via Eq. IF. 2 1 Consider the 
case where an equal number of independent samples are obtained in each phase, so that = ng. What the FF method does 
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is to start off with an estimate of OI-ba, say 'JIba = (where C is the estimate of Eq. 14.301 1, and use this to obtain separate 
estimates of the overlap O from simulations performed in one of the two phases: 

Oa - 2<f(WBA-C)>^.^ 

= 2 < >^c (F.5) 

I + JIbac^^^ 

and 



Ob = 2< f{-[WBA-C])>^^^ 
1 

= 2 < , —- >^o (F.6) 

'^BA 

Only if OIba is an unbiased estimator for JI-ba will the two estimates of Oa and Ob converge to the same value. Therefore 
what Bennett's recursive prescription (Eq. 14.3 II and Eq. I4.32t does is to vary ones estimate of JIba (through C) until the 
estimates of the overlap Oa and Ob have converged to the same value. At this point one can be sure that the estimate JIba 
reflects the true value of JI-ba since: 



■BA 



. < fiWBA-C)>^^^ 
< f{-[WBA-C]) >,c 



Ob 



^BA 



(F.7) 



Appendix G 

Multihamiltonian method as a limiting 
form of tlie Fast Growth method 

The MH method can be viewed as a limiting form of the FG method. The key insight is the observation that as the equilibration 
time At increases: 

At^oo P{R4ba\U) ^ PiMBAlTTx,) (G.l) 

where we recall that P{MBA\ti) denotes the probability distribution of Mb a at time ti, when the configurational energy has 
been incremented from E\^_-^ to and after the system has been equilibrated with tt^ for a time At. This stems from the 
fact that if one perturbs the configurational energy from £Ai_i (v(i — 1)) to £.\. (v(i — 1)), and then equilibrates the system for 
an infinite amount of time, to a configuration v(i), then the ensemble of configurations {v(i)} will be Boltzmann distributed 
with distribution tt^ . In other words one finds that in the case of adiabatic equilibration J'^^^[v(l), v(2), ...v(n — 1)] 
assumes the simple form: 

n-l 

n^i3[v(l),v(2),...v(n-l)]cx He-'^'^'^"^^" (G-2) 

i=l 

This is exactly the sampling distribution of the MH method (see Eq. I5.15> . Therefore the MH method can be viewed (for 
a given AA) as a limiting case of the FG method in which the equilibration time is infinite (i.e. adiabatic equilibration). 
Figure IgITI shows how the distribution P{W^~f\TTg) (of the FG method) tends to the limiting form of the distribution of the 
MH method as the equilibration time (At) is increased, and clearly illustrates further the connection between the MH and FG 
methods that we have just described. 

In regards to systematic errors, it follows that if one uses the EP estimator, then the systematic errors associated with the 
MH method will be less than or equal to those of the FG method since adiabatic equilibration translates to minimum systematic 
errors. However in the case of the PS method this statement no longer holds if both methods have sufficient overlap so as to 
ensure that the phase switches can take place. In this case both methods have zero systematic errors since they both visit all 
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Figure G.1 : P(W^BA|7r^) for the MH and the FG methods 



For the MH method Wba= Mb a- The figure compares the distributions obtained via the FG method, as the equilibration 
time At is increased, with that of the MH method. We see that as increasing equilibration (At) is allowed between successive 
work increments SWBA,i (for the FG method, Eq. I2.93> . the peaks and widths of the probability distribution reduce, tending 
asymptotically to the form assumed by the MH method. The number of replicas (n) was 10. 
T* = 1.0, RSM. 



the important regions of (effective) configuration space which contribute to the estimate of JI-ba- 

Apart from the issue of systematic errors, another difference of the two methods is the way in which they are realised. In 
the FG method one performs work on a single system as described in section l2'.4.8l In the MH method, one makes use of the 
form of Eq. IG.2I which allows one to realise the Wba distribution by performing independent simulations in parallel. This is 
a significant difference in that it allows considerable speedup of the task of evaluating the FED since one may parallelise the 
process. This will become especially apparent in chapter|6lwhen we apply the method to the study of quantum FEDs. 



Appendix H 

Details of the quantum simulations for the 
Lennard-Jones potential 

In this section we clarify the way the different parameters that enter into the calculations of the hamiltonian of the polymeric 
systems for the PA (Eq. I6.26> and the HOA (Eq. I6.51> . For simplicity we will work in the r representation. We recap that 
for the case of distinguishable quantum particles the PIMC simulation involves the simulation of a system with a partition 
function given by: 

Z = ydri....dr„(^^)^exp{-/3J{({r})} (H.l) 

where: 

/^^({r}) - E^[r«+i " r^]' + ^E^w (H.2) 

i=l ^ 1=1 

In the case of the Lennard-Jones configurational energy the distances are measured as units of a. Suppose that the super- 
script ^ over a variable denotes the fact that it is expressed in units of a, so that f = r/cr. Then we may conveniently 
express all quantities in terms of these scaled variables. Suppose that x^'^^ denotes the k-th coordinate of a particle, so that 

= ix,y,z). If: 

Sij = \J {xt - XjY + (yj - yjY + {z^ - zj)^ (H.3) 
and if £{sij) denotes the contribution to the overall configurational energy of the interaction between particle i and j, then it 
follows that: 



£.{sij) 



Sij Si^ 
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where G{sij) is a dimensionless number. If we denote by /3* the quantity (3* = then it follows that in the case of the PA, 
Eq. IH.2l mav be written as: 

^n{r}) = E^[f^+i - + f^E^Ifd (H.5) 



where: 



2 

kl 

(i) 

and where s j./ denotes the distance between particles k and 1 in replica i. 
Furthermore: 



ox] °v *y 

- {-)'H{S,,) (H.7) 
cr 

where H{fij) is also a dimensionless function. Therefore if the HOA is used then it follows that: 
where H[ri] is given by: 

H[r.,]^Y.H{S^:h) (H.9) 

In using these quantities in the simulation, one must appropriately modify these equations so as to take into account the fact 
that particles only interact with only the first nearest neighbour shell. 



Appendix I 

Interplay between kinetic and 
configurational actions 

It is well known in the Path Integral Monte Carlo literature 11681 that on the transition to a large number of rephcas, the 
kinetic action Sk dominates over the configurational action Sy- Note however this does not mean that the configurational 
action may be neglected on the transition to large number of replicas, since Sy essentially determines where in configuration 
space the polymer resides in, where as 5*;^ controls the magnitude of fluctuations between adjacent rephcas within this region 
of configuration space. 

In this appendix we provide a simple numerical illustration of this for the LJ systems employed in chapter|6] Figure IlD 
shows the dependence of < Sy >n% ™d < >w« on the number of replicas P for a simulation at a fixed temperature. 
For small number of replicas < Sk >tt1 starts off assuming a lower value than < Sy >Tr9 ■ As the number of replicas 
increase both < Sk >n% < Sy >tt''^ increase, until eventually Sk comes to dominate over Sy. Within this regime 

Sk scales linearly with P. < Sy on the other hand, had a positive gradient which decreases as P increases, but never 

quite reaches zero. As a result < Sy appears to plateau off, though the plateau is only reached in the P ^ oo limit. 

The figure clearly illustrates the dominance of Sk over Sy in the large P limit. 
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Figure 1.1 : Variation of < Sy and < St >,r% for a (PA) simulation in which the temperature is fixed and 

the number of replicas is varied 

The temperature was fixed at T* = 0.4 and the number of repUcas employed was varied between the values of P=10 and 
P=130. 

D = 0.1816, pa^ = 1.092, Q-RSM. 
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[127] For the range of temperatures shown in figure ITSl the RSM was also able to switch between the phases without appeal 
to any form of extended sampling. However since the RSM has a low temperature stationary limit and since the mean 
of P{Mba\'^^) scales as N whereas the spread (standard deviation) only scales as \/N it is clear that for sufficiently 
large system sizes the RSM will not be able to switch phases without the aid of extended sampling, irrespective of how 
low the temperature is. This is in sharp contrast to the FSM, which is guaranteed, by construction, to be able to switch 
phases at sufficiently low temperatures without resorting to extended sampling. 
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[128] It is clear that above temperatures of T=0.001 cumulants other than the first two also begin to become important in the 
expansion of the FED (Eg. 13. 241 . Since from these scale as or faster (Eg. 13.291 . one might expect that FED will 
also scale as or faster so as to make a linear fitting inaccurate. However the point is that the anharmonicity that 
appears in the distribution P{Mab Itt^) (leading to the departure from the ideal limit, Eg. l3.1> do not give an accurate 
indication of the amount of anharmonicity present in the system; but merely the additional anharmonicity induced 
under the operation of the transformation. In the calculation of the FED there are large cancellations in the cumulant 
expansion (see Eg. l3.28t . leading to a linear scaling of well beyond the regimes where the first two cumulants 

scale as jS^^. 

[129] As the third cumulant begins to become important in Eg. 13.281 the estimate of AFg^^ based on the first two cumulants 
in Eq. 13.301 has to become negative in order to compensate and maintain the approximately zero FED estimate of 
the FSM-PS method. To see this we note that from Eg. 13.291 we see that the term 2 < M^.y >^c in Eg. 13. 271 scales 

as /3~'^, 3 < My~f >7r^< >t^c scales as /3^^, and < >7rc scales as Therefore on increasing the 

temperature the term 2 < M^-^ >^c will eventually become the most dominant, since it increases most rapidly with 
increasing temperature. Since < M^^ >^c > we see that on increasing the temperature uj^ eventually becomes 
positive. Therefore the second cumulant uj2 must increases faster than uji so as to make the estimate in Eq. 13.301 
negative in order to compensate for this, so that the estimate including the third cumulant is approximately zero. 

[130] The reason for this is that, as was mentioned in section 12.4.21 regions which contribute most significantly to the nu- 
merator of Eg. l2.38l (or strictly the corresponding estimator for a simulation performed in phase B) will not be visited. 
As a result < e~*^^^ >7r|j will be underestimated, and as a consequence its use to estimate AFjJ^ will result in an 
overestimate. This is clearly what is observed in figure lT9l 

[131] The multicanonical sampling distribution was constructed so as to contain both P{Mab\'^a) -PI-^^abIti'b)- See 
section l2'.4.3l and section l2'.4.7l 

[132] We recap that in the case of the PS method MUCA sampling distribution must ensure that the Mab ^ regions are 
accessible to the simulation irrespective of which phase it is in. This means that one must explicitly construct MUCA 
weights so as to bias the regions of Mab space in between the two peaks (see figure IZ2l i. Therefore the closer the 
peaks, the narrower the region of Mab space that will need to be reweighted. We also note that as soon as the Mab ^ 
regions are accessible to the sampling distributions tt^ and tt^, then extended sampling will not be necessary for the 
PS method. However on the onset of this happening, extended sampling will still be needed for the EP estimator since 
for this method the sampling distribution only naturally samples one phase, and one must therefore /orce the simulation 
into the regions associated with the conjugate phase. 

[133] Or more generally between P{Wba\t^^a) P{Wba\''^b) ™ '^he case of arbitrary equilibration EG simulations, of 
which the zero equilibration simulations are a particular case (see section l2'.4.8> . 

[134] N. Lu, J. K. Singh, and D. A. Kofke. Appropriate methods to combine forward and reverse free-energy perturbation 
averages. Journal of Chemical Physics, 1 18(7):2977-2984, February 2003. 

[135] N. Lu and D. A. Kofke. Accuracy of free-energy perturbation calculations in molecular simulation, i. modelling. 

Journal of Chemical Physics, 1 14(17):7303-731 1, May 2001. 

[136] N. Lu and D. A. Kofke. Accuracy of free-energy perturbation calculations in molecular simulation, ii. heuristics. 
Journal of Chemical Physics, 1 15(15):6866-6875, October 2001 . 

[137] N. Lu and D. A. Kofke. Optimal intermediates in staged free energy calculations. Journal of Chemical Physics, 
1 1 1(10):4414-4423, September 1999. 

[138] In actual fact the N particle system is a system of Nh-1 particles in which one particle does not interact with the others. 

[139] The statistical errors in a finite time Monte Carlo simulation will depend on the choice of the estimator, since differ- 
ent estimators focus their emphasis on difference regions of Wba space. Under the phase constrained distributions 
P{Wba\t^a) ^^'^ P{Wba\t^b) these regions are visited with different probabilities, and hence the statistics of the 
'most important' regions will vary for the different estimators. 

[140] M. R. Shirts, E. Bair, G. Hooker, and V. S. Pande. Equilibrium free energies from nonequilibrium measurements using 
maximum-Ukehhood methods. Physical Review Letters, 91(14): 140601, October 2003. 
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Eg. lTBl 




where: 



and where the paths Yt = r(t) have the constraint that r(i) ~ r(0). 
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^ ' ^ ' 12 dfi^ 24 dn 
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